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Never identify any mathematical model 
with reality. 


—THE AUTHORS 


Preface 


Einstein’s General Theory of Relativity represents a very complex theoretical 
discipline which intersects mathematics and physics. Its results are applied in 
relativistic cosmology, which is a branch of astronomy that deals with the greatest 
spatial and temporal distances. For some time there has been a growing crisis in 
cosmology. The contemporary cosmological model possesses at least 20 serious 
paradoxes, see e.g. Sect. 1.1. Nevertheless, it is still defended by most cosmologists. 
We show that the main reason for this is that the simple mathematical models used 
in cosmology are identified with reality and that various incorrect extrapolations are 
being made. 

In 1999, the first author of this monograph published the paper [157] about delay 
differential equations: Numerical experience with the finite speed of gravitational 
interaction which indicated that the laws of conservation of energy and angular 
momentum can be slightly violated in our universe. This was the beginning 
of a detailed analysis of mathematical and numerical methods used in current 
astrophysics and relativistic cosmology. 

This treatise is based on our more than 50 works in Special and General Relativity 
and related topics that we published mostly in prestigious international journals such 
as Journal of Computational and Applied Mathematics, Gravitation and Cosmol- 
ogy, Communications in Computational Physics, Astrophysics and Space Science, 
Mathematics and Computers in Simulation, International Journal of Astronomy and 
Astrophysics, New Astronomy, Neural Network World, Astronomische Nachrichten, 
and the Czech journal Pokroky (=Advances) matematiky, fyziky a astronomie (see 
e.g. dm1 .cz). Most of our results were reported at many international conferences 
(see e.g. [162, 175, 177-180, 195]) and also at our regular seminars Current 
Problems in Numerical Analysis and Cosmological Seminar which take place at the 
Institute of Mathematics of the Czech Academy of Sciences in Prague [387, 398]. 

Prague stood at the birth of two revolutions in building the theory of gravity: 


1. At the beginning of the seventeenth century in Prague, Johannes Kepler for- 
mulated his first and second laws of planetary motion with the help of Tycho 
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Brahe’s observational data. This discovery played a key role in Newton’s theory 
of gravitation. 

2. In the period 1911-1912, Albert Einstein received his first full professorship at 
Charles-Ferdinand University in Prague. His colleague Georg Pick worked on 
non-Euclidean geometries and taught Einstein the foundations of tensor calculus. 
In Prague, Einstein found the calm atmosphere necessary to formulate the basic 
ideas of his General Theory of Relativity. Here he wrote several important papers 
on this topic. 


There are many books on General Relativity and cosmology. Let us mention, 
e.g., [107, 184, 249, 270, 272, 332, 337, 369, 371]. However, our book is written 
in a different style. Most of the important results are formulated in the form of 
mathematical theorems with precise assumptions and statements. Our book also 
contains some nonstandard topics such as: the violation of the principle of causality 
in Newtonian mechanics, critical mathematical and numerical analysis of Mercury’s 
perihelion shift, and the large computational complexity of Einstein’s equations 
which renders them factually inapplicable for the classical two-body problem. 
Further topics include the nonuniqueness of the notion universe, topology of the 
universe, various descriptions of a hypersphere, regular tessellations of hyperbolic 
spaces, flight around the universe, local Hubble expansion of the universe, neglected 
gravitational redshift in detection of gravitational waves, possible distribution of 
mass inside a black hole. We would also like to dispel some myths appearing in the 
theory of relativity and contemporary cosmology. 

Our book is intended for a general audience—especially for those who can appre- 
ciate the beauty of both mathematics and cosmology, who are open to new ideas, 
and who want to break out of the rut. We only assume that the reader is familiar 
with the basic rules of arithmetic and has no problem with adjustments of algebraic 
formulas. Only very rarely it is necessary to understand some deep relationships 
from linear algebra or calculus. Most chapters can be read independently from one 
another. Some parts are quite simple, others more complicated. If some part is too 
difficult, there is no problem in skipping it. 

For inspiration, there are also several links to websites, although we are well 
aware that they are not subjected to any review and change quite frequently. Newly 
defined terms are highlighted in italics in the text for the convenience of the reader. 
They can also be found in the Index. 

In order to read the individual chapters, it is not necessary that the reader 
understands all assertions. Mathematicians formulate their ideas in the form of 
theorems that contain only what is relevant in the problem in question. To avoid 
any discussion whether a particular statement is true or not, we provide proofs of 
most statements so that everyone can verify their validity. For more complicated 
proofs, we only give suitable references to the corresponding literature. 

In a number of discussions, many researchers helped us to improve the content 
of this book, in particular, Jan Brandts, Jan Chleboun, Caroline Griffis, Jiri 
Grygar, Vesselin Gueorguiev, Ivan Hlavacéek, Kurt Koltko, Pavel Krizek, Monika 
Lanzend6rferova, FrantiSek Lomoz, André Maeder, Jan MarSak, Jaroslav Mlynek, 
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Vladimir Novotny, Lubo§ Pick, Mirko Rokyta, Karel Segeth, Antonin Slavik, 
Vaclav Vavrycuk, TomaS Vejchodsky, Emil Vitaések, and Vladimir Wagner. We 
really appreciate their help and they deserve our great thanks. Furthermore, we are 
deeply grateful to Attila Mészaros for his excellent regular lectures on cosmology 
at Charles University in Prague, and to Hana Bilkov4 and Eva Ritterova for 
their technical assistance in the final preparation of the manuscript. Preliminary 
I4TRX typesetting was prepared by Michal Kfizek. 

Finally, we are profoundly indebted to Elena Griniari, Francesca Ferrari, and 
Harini Rajesh from Springer-Verlag for their helpful cooperation in the preparation 
of this book. Our great thanks go to the Springer Publishing House for its care in 
the graphical design of the book and to the referees for invaluable suggestions. We 
are also grateful to our families for patience and understanding. 

This book was supported by the Czech Academy of Sciences (RVO 67985840). 
Some of our results contained in Chaps. 3-8 were developed in part by our work 
supported by Grant No. 23-06159S of the Grant Agency of the Czech Republic. 


Prague 1, Czech Republic Michal Kiizek 
Washington, DC, USA Lawrence Somer 
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Set of natural numbers 

n-dimensional Euclidean space 
Three-dimensional sphere (hypersphere) 
Three-dimensional hyperbolic pseudosphere 
Curvature index from {—1, 0, 1} 

Set of complex numbers 

Open interval 

Closed interval 

Absolute value 

Logarithm to the base b 

Natural logarithm 

Euler number 2.718 281 828... 

For all 

There exist(s) 

Diagonal matrix (tensor) 

Determinant 

Volume 

Transposition 


Landau symbol: f(a@) = O(g(qa)) if there exists C > 0 such 


that | f(@)| < C|g(a@)| fora > 0 ora > oo 
Empty set 

Ludolph number 3.141592 653... 

Product 

Sum 

Intersection 

Union 

Set subtraction 

Subset 

Is element of 

Is not element of 

Set of all elements x from A which possess property P (x) 
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Glossary of Symbols 


Mapping (function) f from the set A to the set B 
Function which assigns value f(x) to x 
Implication 

Assignment 

Expansion function 

Time derivative of a = a(t) 

Halmos symbol 

Sun 

Radius of the Sun 695 700 km 

Mass of the Sun 1.988547 - 10°° kg 

Mass of the proton 1.67- 10-7’ kg 

Redshift 

Speed of light in vacuum 299,792,458 m/s 
Gravitational constant 6.674 - 10~!! m?kg~!s~? 
Cosmological constant ~ 107>* m7? 
Stefan-Boltzmann constant 5.669 - 10~® Wm~*K~+ 
Planck constant 6.626070 - 10~*4 Js 
Astronomical unit 149,597,870,700 m 

Parsec 3.262 ly = 206 265 au = 3.086- 10! m 
Hubble-Lemaitre constant ~ 70 km/(s Mpc) 
Hubble parameter 

Sidereal year 365.256 36 days = 31558 149.54s + 2 - 10's 
Light-year 63,242 au= 9.46 - 10!5 m 

Century 

Electronvolt = 1.602 - 107!9 J 
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Mathematical Modeling nn 


1.1. Introduction 


Cosmology is a branch of astronomy that deals with the greatest spatial and 
temporal distances and the questions of the origin of the universe as a whole. 
However, the current standard cosmological model leads to a wide range of serious 
paradoxes (or even catastrophes—see Index) and does not give acceptable results 
(see e.g. [24, 25, 79, 184, 205-207, 217, 226, 360]). Therefore, in this book we will 
analyze mathematical models used in cosmology and investigate their theoretical 
background. 


The main problems of the standard ACDM (i.e. Lambda-Cold Dark Matter) 


cosmological model, which should not be ignored, can be summarized as follows: 


10. 


11. 


. The problem of the existence of dark matter (see Sect. 8.1). 

. The problem of the existence of dark energy (see Sect. 9.1). 

. The problem of isotropy of the universe (see Remark 6.1). 

. The problem of homogeneity of the universe (see Remark 6.2). 


The flatness problem (see Remark 6.19). 


. The problem of the existence of the cosmological constant (see Remark 3.15). 
. The problem of the accelerated expansion of the universe and the energy non- 


conservation problem (see Remark 3.16 and Sect. 10.6). 


. The problem with observed values of the cosmological parameter og (see 


Remark 6.22). 


. The problem that the standard cosmological model admits a division by 


zero and that cosmological parameters have a strange behavior in time (see 
Sect. 6.7). 

The problem with inconsistency of different values of the Hubble-Lemaitre 
constant obtained by different methods (see Remark 6.23). 

The problem of hierarchical structures (see Remark 1.12). 
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12. The problem of the existence of stars that are older than the universe according 
to the standard cosmological model (see Remark 7.28). 

13. The problem of setting up accurate initial conditions (see Remark 10.2). 

14. The problem of the existence of an infinite universe (see Remarks 6.27 
and 6.28). 

15. The inflation problem (see Remark 7.7). 

16. The problem of the existence of giant black holes in the early universe (see 
Sect. 3.6). 

17. The horizon problem (see [122]). 

18. The problem of the large angular momnetum and velocities of all spiral galaxies 
(see Remark 9.7). 

19. The problem of a 120-order-of-magnitude discrepancy between the density of 
fluctuations in a vacuum, which should have gravitational effects, and theoreti- 
cally derived density of dark energy (see [11] for the vacuum catastrophe). 

20. The Big Bang problem itself (see Remarks 6.8 and 6.21). 


Some of these problems are overlapping, but their number is growing. Most 
mainstream cosmologists do not mind that these problems exist and still defend the 
standard cosmological model. Their frequently used argument: “We have nothing 
better” does not hold up. 

Why is this so? The main reason is that physical reality is identified with very 
simple mathematical models (see e.g. (7.21) below) ignoring the modeling error. 

The second reason is that people like sensationalism: Our universe consists of 
25 % of some invisible mysterious dark matter and 70 % of even more mysterious 
dark energy, and nobody knows what it is, etc. However, these mysterious entities 
could be mostly the above-mentioned modeling errors. 

The third reason is that cosmology is developing too rapidly and spontaneously. 
It usually does not rigorously use exact mathematical procedures and methods. 

From 1-20 it is clear that the standard cosmological model, which is a direct 
mathematical consequence of Einstein’s equations, does not approximate the physi- 
cal universe well [200]. In our book we will investigate mathematically some of the 
above-mentioned problems and puzzles. 

Mathematical models of physical phenomena, in general, are typically described 
by algebraic equations, ordinary or partial differential equations, integro-differential 
or integral equations, systems of these equations, variational inequalities, systems 
of differential-algebraic equations, etc. Their exact analytical solution is usually not 
known and there is no universal method capable of solving the above-mentioned 
models. Therefore, each class of problems has to be investigated individually. This 
consists of several steps including, as a rule, the proof of the existence (and, 
if applicable, also uniqueness) of the true and approximate solution, numerical 
solution of approximate problems and its stability, the treatment of convergence 
questions, continuous dependence of solutions on data, etc. 

We should have in mind several important features in the mathematical modeling 
of physical problems. First, one should realize that no physical relation (except for 
definitions) holds entirely exactly, since the universe is composed from elementary 
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particles and their behavior can be predicted only with some probability (due to the 
Heisenberg uncertainty principle and the Born probabilistic interpretation of wave 
functions). That is why each physical equation can only approximately describe 
reality. Any model problem thus includes several simplifications. The most common 
simplification is a “continuous approach’, where discrete objects (such as molecules 
or atoms) of the universe are replaced by continuous ones. This idea allows us to 
employ the theory of differential or integral equations. 

Second, we must never overestimate mathematics itself. Do not forget that 
mathematics is only a “game” (even though this game has precisely defined notions 
and rules). Mathematical models do exist only in our minds. They are invented 
mainly for better understanding of the universe. 

Third, validation and verification of problems of mathematical physics and 
their computer implementation is a very important part of numerical analysis. We 
always encounter two basic types of errors: modeling error and numerical errors 
such as discretization errors, computational errors (iteration errors, algebraic errors, 
rounding errors) and also undiscovered programing bugs. Validation tries to estimate 
the modeling error and to answer the question: 


Do we solve the correct equations? 


On the other hand, verification tries to quantify the numerical errors and to answer 
the question: 


Do we solve the equations correctly? 


We will return to these questions in Sects. 1.5 and 1.6. Now we will focus on 
mathematical modeling in celestial mechanics. 


1.2 Kepler’s Laws 


Already since the seventh century AD Chinese astronomers were assembling 
detailed tables of positions of the planets, a thousand years before Johannes Kepler 
(1571-1630). They even knew that the trajectories of the planets form peculiar loops 
on the celestial sphere. However, they did not discover Kepler’s laws, since they did 
not have a good geometric model of the functioning of the Solar system and their 
measurements of positions of planets were one order of magnitude less accurate than 
the measurements of the Danish astronomer Tycho Brahe (1546-1601). It must also 
be taken into account that Kepler knew the Copernican heliocentric model of the 
Solar system very well and that Kepler’s ingenuity certainly played a big role as 
well. 

Johannes Kepler analyzed the very accurate data of Tycho Brahe on the 
movements of the planets during his stay in Prague from 1600 to 1612. He found 
that the planets follow elliptic orbits and that the areal velocity of the radius vector 
that originates at the Sun is constant for each planet. In this way the first and second 
laws of Kepler were discovered. They were first published in 1609 in one of Kepler’s 
most important works, Astronomia nova. At present they are formulated as follows: 
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Fig. 1.1 Illustration of 
Kepler’s method for the 
determination of the elliptic 
orbit of Mars, which after one 
Martian year projects to a 
different part of the sky. Mars 


is located at the intersection > Mats 
of the two directions 


Fig. 1.2 Schematic 
illustration of Kepler’s second 
law stating that the areal 
velocity is constant 


Kepler’s First Law The orbit of each planet is elliptic with the Sun at one of its 
foci. 


Kepler’s Second Law The radius vector connecting a planet and the Sun sweeps 
out equal areas of the ellipse during equal intervals of time. 


Now we recall how Kepler discovered these laws. He knew that the orbital period 
of Mars is 687 days, therefore, after this time period Mars returns to the same place, 
while the Earth has orbited the Sun almost twice. This actually defines two different 
directions in which Mars is projected on the celestial sphere and Kepler could 
therefore determine its location in the orbital plane (see Fig. 1.1). Repeating this 
procedure for different time instants, with the help of coordinates of Mars measured 
by Brahe, Kepler could draw the entire orbit of Mars and find that its trajectory is 
elliptic (see Kepler’s first law). 

When Kepler ascribed relevant time data to the respective positions of Mars, he 
discovered the second law (see Fig. 1.2). 

Let a > b stand for the lengths of the semiaxes of an elliptic orbit of the planet. 
For simplicity, we will also denote these semiaxes by the same symbols a and b. 
Here a is called the semimajor axis and b the semiminor axis provided a > b. 

The distance ¢ of the focus of the ellipse to its center is called the linear 
eccentricity and is defined by the relation (see Fig. 1.3) 


6 = Va? — b?. 
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Fig. 1.3 Keplerian trajectory 


Similarly 
eee (1.1) 
a 


is called the numerical or orbital eccentricity. For simplicity, we will call this 
dimensionless number only the eccentricity. We see that 


b=a/l=e. (1.2) 


The case e = 0 corresponds to a circle. The eccentricity of an ellipse is always less 
than one. 

Kepler was actually very lucky that he concentrated his attention mainly to Mars, 
because its orbit has a relatively large eccentricity e = 0.0935 ~ 0.1 (see Fig. 1.1). 
A consequence of Kepler’s first and second law is (see [155]): 


Kepler’s Third Law The square of the orbital period of a planet is proportional to 
the cube of the semimajor axis of its orbit. 


Kepler discovered the third law due to his consistent and systematic work. 
Essentially, he found it empirically, when he left Prague. Kepler’s third law can 
be formulated as 


T* =Ca?. (1.3) 


Here, T is the orbital period of the planets and C > 0 is a constant. At that time the 
multiplication of numbers was accomplished by means of the sum of logarithms and 
after using the inverse procedure, Kepler obtained the product. He noticed a simple 
dependence 


21logi9 T — 3 logjy a = const. 


between the decimal logarithms of the measured value of the orbital period T and 
the length of the semimajor axis a. It was then a short step to the formulation of 
the third “harmonic law”, which Kepler published in 1619 in his work Harmonices 
mundi libri V. In 1621, Kepler found that his new law also applies to the four large 
moons of Jupiter. 
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Remark 1.1 From (1.3) we may calculate e.g. distances of all the planets from the 
Sun knowing their orbital periods and the Earth-Sun distance. 


Now we will introduce some further useful relations for elliptic planetary orbits. 
Assume that the mass of a planet is negligible relative to the mass of the Sun. The 
point of its furthest (closest) approach to the Sun is called the aphelion (perihelion). 
If; and r are the respective distances of the planet at aphelion and perihelion from 
the focus F’, where the Sun is located, then r; = a+eé andr = a — « (see Fig. 1.3). 

From the relations 


2a=r+n, =r —12, (1.4) 


and b = Va? — €? we see that the length of the semimajor axis a is equal to the 
arithmetic mean of distances r; and ro, 1.e., 


mtr 


7 (1.5) 


while the length of the semiminor axis b is equal to their geometric mean, i.e. 


pa l(OS") (C8 av 16 


Denote by v; and v2 the respective velocities of a planet at aphelion and 
perihelion. By the law of conservation of angular momentum (mrv =const.) it 
follows that 


ry) vi, = 1202. 
From this, (1.4), and (1.1) we obtain 


v2 r| ate l+e 
v™oom a-e 1l-e 


(1.7) 


For a fixed e, the ratio v/v, is thus a constant independent of the size of the elliptic 
orbit. For the eccentricity e = 0.0935 of Mars’ orbit we obtain a fairly high ratio 
v2/v, = 1.206, which actually helped Kepler to discover his second law. For the 
eccentricity e = 0.2056 of Mercury’s orbit (see Table 1.1) we get a much higher 
ratio v2/v, > 1.2/0.8 = 1.5. Moreover, according to (1.7), we find that 


vw-v or r2 
U2 + VY retre’ 


(1.8) 


where r} = a(1 + e) andr2 = a(1 — e). 
Each planet is generally moving along its orbit nonuniformly. It is therefore 
important to be able to determine its coordinates at a given time instant in the plane 
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Table 1.1 The symbol 7 denotes the ith planet, m; its mass in kg, a; is the semimajor axis of its 
orbit in meters, 7; is the orbital period in years, v; is the mean orbital velocity of the planet around 
the Sun in km/s, R; is the mean radius of the planet in km, and e; is the orbital eccentricity 


i Planet mj; /1074 ai/ 10° T; Vj R; ej 

1 Mercury 0.33022 57.9 0.241 47.9 2440 0.2056 
2 Venus 4.8676 108.2 0.615 35.0 6052 0.0068 
3 Earth 5.97219 149.6 1 29.8 6371 0.0167 
4 Mars 0.64185 229.97 1.881 24.0 3390 0.0935 
5 Jupiter 1898.6 7718.4 11.861 13.1 69911 0.0487 
6 Saturn 568.46 1427.0 29.457 9.6 58232 0.0520 
7 Uranus 86.81 2869.6 87.011 6.8 25362 0.0472 
8 Neptune 102.43 4496.6 164.79 5.4 24624 0.0086 


of its elliptic path, and those are what is determined by the Kepler equation, see 
e.g. [159, 184] for details. In this case, the sum of kinetic and potential energy is 
constant. 


1.3. Newton’s Gravitational Law 


Newton’s law of universal gravitation played a crucial role in the development of 
physics. It significantly contributed to understanding the structure and evolution 
of the universe. Sometimes it is called the discovery of the last millennium, as 
will be indeed seen from the following paragraphs. It was first formulated by Sir 
Isaac Newton (1642-1726) in his treatise Principia from 1687. In particular, he was 
inspired by Kepler’s third law. 

According to Newton’s law of gravitation, the size of the gravitational force 
between two point masses is equal to 


F=G (1.9) 


re? 


where m and M are their masses, r is their distance, and 
G = 6.674- 107|! m3kg7!s~? 


is the gravitational constant (cf. Remark 10.1). 


Remark 1.2 If m is the mass of a planet and M = Mo the mass of the Sun, then for 
a circular planetary orbit of radius r = a by Newton’s law of action and reaction, 
the gravitational force (1.9) is equal to the centripetal force, i.e., 


mM mv 
G— = —, 
2; 
a a 
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where v is the orbital velocity of the planet. From this for v = 27a/T we obtain as 
a result Kepler’s third law in the form 


ar OM = 0) (1.10) 
T2” 4n2 ©” , 


where it is implicitly assumed thatm < M. 


Remark 1.3 Observe that relation (1.10) connects three important quantities with 
the physical dimensions of length, mass, and time. In the system of units SI, they 
are expressed in 


m, kg, s. 


Knowing two of them, we can calculate the third one. Therein lies the beauty of 
Kepler’s third law. 


In some textbooks it is inaccurately stated that F in (1.9) is the size of 
gravitational force between two bodies. In doing so, the reader will not know how 
to define precisely their distance. For example, what is the distance r between a 
homogeneous tiny ring (almost a circle) and a point mass at its center? That is: © 


1. If the distance r is equal to the radius of the ring, we do not get the correct answer 
by (1.9), since the resulting total force (and thus also its size) is zero. 

2. However, for F = 0 relation (1.9) yields r = 00, which certainly has nothing to 
do with reality. 

3. If r is the distance between the centers of gravity of both bodies, then we 
surprisingly obtain F = oo, because the denominator in (1.9) is r = 0. Again, 
we get into trouble. 


Thus we see that a mechanical use of the law of gravity can lead to unexpected 
paradoxes. In what follows we shall, therefore, often restrict ourselves only to 
idealized point masses, which in the physical world do not exist. However, 
relation (1.9) remains unchanged if instead of point masses we consider balls with 
special density distributions as follows from the next theorem. 


Theorem 1.1 (Newton’s First Theorem) /f the density distribution of a ball of 
mass M is spherically symmetric, then the size of the force between the ball and a 
point mass m that lies outside the interior of the ball, is given by (1.9), where r is 
the distance between the point and the center of the ball. 


The proof is based on the specific shape of the gravitational potential of 
mass uniformly distributed on a spherical surface. The resulting equation (1.9) is 
then obtained by integration, see e.g. [15, p. 149]. Theorem 1.1 can evidently be 
generalized to the interaction between two balls with spherically symmetric mass 
density distribution (see Fig. 1.4). 
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© : 
M m 


Fig. 1.4 Illustration of Newton’s first theorem for two spherically symmetric bodies 


Fig. 1.5 Roche lobe of two closely orbiting bodies 


Fig. 1.6 The force between two bodies cannot generally be substituted by the force between two 
point masses located at their centers of gravity 


Remark 1.4 Celestial bodies with diameter of over ~1000 km are automatically 
formed by gravity in an approximate spherical shape and their density distribution 
also becomes almost spherically symmetric. This process is called gravitational 
differentiation. However, two large closely orbiting bodies are not spherically 
symmetric due to tidal bulges. In Fig. 1.5 we see the intersection of contour lines 
of the gravitational potential at the Lagrangian point L; of the corresponding Roche 
lobe (cf. [28, p. 350]). 


Example 1.1 Yn general, bodies that do not have a spherically symmetric mass 
density distribution cannot be replaced by point masses at their centers of gravity 
(e.g. the elongated asteroid Ida which is orbited by the satellite Dactyl). To see this, 
it is enough to consider how a dumbbell-shaped body with mass M = M, + M2 
acts on the second body, which is a homogeneous ball of mass m. The mass of the 
straight middle part of the dumbbell is neglected. Set Mj = M2 = m = | kg. The 
scale on the horizontal axis in Fig. 1.6 is indicated in meters. Then the magnitude of 
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the total force between the two bodies is equal to 


However, if we concentrate the mass of the dumbbell at its center of gravity at 0, 
then the resulting force F will be essentially different from F, i.e. 


is M M 
Fs Gin’ it 2) GC 


= 2 and 2F <F. 


The quadratic nonlinearity r* in the denominator of Eq. (1.9) thus causes that the 
force F is more than two times bigger than the force F' corresponding to the mass 
of the first body concentrated at its center of gravity. 


Example 1.2 Is it possible to measure the mean mass density of the Sun using a 
protractor? To solve this seemingly absurd problem assume for simplicity that the 
Earth’s orbit is circular. From (1.9), Newton’s second and third law (of action and 
reaction), we get as in Remark 1.2 that 


M 2 
Ea cy (1.11) 


r2 r 


where Me is the mass of the Sun, m is the mass of the Earth, r is their mutual 
distance, and v is the speed of the Earth. By a “careful” use of the protractor, we find 
that the viewing angle of the Sun is approximately 6 = 32’. Hence, Ro = r sin 56 
is the radius of the Sun, see Fig. 1.7. Clearly, 


2ar (1.12) 
v= — ; 
T ’ 


where T = 31,558,149.54 s = 365.25636 days is the Earth’s orbital period (the 
sidereal year). Denoting by V = 30 Re the volume of the Sun and substituting Mo 


je 


Fig. 1.7 The mean mass density of the Sun can be determined from its viewing angle 5 and the 
Earth’s orbital period (see (1.13)) 


1.4 The N-Body Problem 11 


Fig. 1.8 Illustration of 
Newton’s Second Theorem 
1.2 in a cross-section of the 
spherical layer. The force 
acting on a point mass inside 
the cavity marked by + is 
zero for a spherically 
symmetric mass density 
distribution. The gravitational 
potential inside the cavity is 
constant 


from (1.11), we get by (1.12) that the mean mass density is 


M. . 2nr)? - 3 
a= OUT _ call 2 _ 1 = 1409 kg/m}, 
V GV 1°G. 3n(rsin36)> = T?G sin? 55 


(1.13) 


i.e., itis about 40 % higher than the density of water. Moreover, using Kepler’s third 
law (1.10), we find that the mass of the Sun is about 


Mo = 2- 10°? kg. 


Another important theorem for a spherical layer states (see Fig. 1.8): 


Theorem 1.2 (Newton’s Second Theorem = Shell Theorem) A spherical layer 
with spherically symmetric mass density distribution exerts no force on a point mass 
located inside. 


Remark 1.5 Newtonian celestial mechanics is very elegant and perfectly models 
reality on short time scales. It is the simplest theory of gravitation. Claudius Ptolemy 
(ca. 100-170 AD) described motion of planets by very complicated epicycles. On the 
other hand, the General Theory of Relativity (and its extensions) is too complicated 
to describe the evolution of the Solar system as we shall see in Sect. 5.3. 


1.4 The N-Body Problem 


For an integer N > 2 consider an isolated system of N point-mass bodies that 
mutually interact only gravitationally. Let r; = r;(t), i € {1,..., N}, be the so- 
called radius-vector in Euclidean space E? describing the position of the ith body 
with mass m; in time tf > 0. According to Newton’s second law of motion, the force 
between them is linearly proportional to the acceleration. By (1.9) the classical N- 
body problem is described (see [1, 156, 184, 234]) by the following nonlinear system 


12 1 Mathematical Modeling 


of ordinary differential equations for the unknown trajectories 7; 


mij —Fi 
Aaoy™ = a eats (1.14) 


por 
i 


where 7; denotes the second time derivative of r;. However, this system does not 
have a unique solution. Therefore, at time t = 0 we impose initial conditions on 
positions p; of all bodies and their velocities v;, 


r(0) = pi, *(0)=v; fori=1,...,N (1.15) 


which, of course, have an essential influence on the resulting motion of the bodies. 
Note that Eq. (1.14) are called autonomous, since their right-hand side does not 
explicitly depend on time. Hence, without loss of generality, we can take the initial 
conditions at t = 0. 

From the theory of ordinary differential equations, we know that the solution of 
problem (1.14) and (1.15) exists uniquely on an arbitrarily long time interval, where 
all mutual distances |r; — r;| fori < j are nonzero, i.e., the bodies do not collide. 
Note that the probability that two or more point masses collide within a fixed time 
interval is equal to zero, even though this case may theoretically occur. Although 
there are numerous collisions in the physical world, we should not forget that in 
the classical multiple body problem only point masses are considered, while actual 
bodies always have positive sizes. 

The N-body problem has many real-world technical applications. As an example 
let us recall the complex gravitational ping-pong of the journey of Voyager 2 to 
Jupiter, Saturn, Uranus, and Neptune, when they all were on the same side of the 
Sun. The idea of accelerating probes by planetary gravitational fields was developed 
around 1960 during numerical simulations of the multiple body problem. It is a 
concept of the American mathematician Michael Andrew Minovitch. The actual 
trajectory is obtained by the numerical solution of a system of differential equa- 
tions (1.14). Each planet greatly accelerated the probe by means of its gravitational 
field and also changed its direction with respect to the Sun. The probe actually fell 
into the gravitational potential well of each planet. Sometimes this orbit is referred 
as a gravitational slingshot. Higher kinetic energy is gained by the probe at the 
expense of the total energy of the planet. The speed of the probe after a close flyby 
about a planet decreases, of course. 


Remark 1.6 Newton’s gravitational law also enabled us to discover e.g. the planet 
Neptune and white dwarf Sirius B, to land six Apollo missions on the Moon, to 
obtain unique photos of planets and their moons, to calculate trajectories of asteroids 
threatening the Earth, to launch meteorological and telecommunication satellites 
(including GPS), to place telescopes in space, and to establish various physical 
parameters in the Solar system (cf. Example 1.2) and distant universe. 
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Remark 1.7 System (1.14) does not satisfy the well-known Carathéodory sufficient 
conditions for the existence of the solution, since the right-hand side of (1.14) is not 
continuous when r; — r;. Despite this, the solution exists locally as long as the 
point-bodies do not collide. The system (1.14) is nonlinear and has a large number 
of unrealistic solutions. For instance, a simple two-body problem on one line admits 
arbitrarily large (superluminal) velocities, see [184, p. 69]. In [313] five bodies are 
considered such that all of them reach infinity in a finite time for appropriate initial 
conditions. The Newtonian theory of gravitation thus may describe completely 
absurd situations. Papers [65, 252] present also a solution of (1.14) when three 
equally massive particles orbit along a trajectory that looks like OO which has not 
yet been observed in the universe. In this special case, the total orbital momentum 
is zero and the trajectory is stable with respect to small long-term perturbations. 


Remark 1.8 The system of ordinary differential equations (1.14) is sometimes also 
used to simulate the evolution of galaxies or even galaxy clusters when galaxies are 
replaced by point masses. Note that galaxies have huge speeds reaching up to a few 
percent of the speed of light. However, system (1.14) does not describe relativistic 
phenomena nor effects of tides, rotation of galaxies, the finite speed of propagation 
of the gravitational interaction, and so on. Hence, it is only an approximate model. 
The N-body problem should not be applied to strong gravitational fields either. 


1.5 General Computational Scheme 


The general computational scheme for solving problems of mathematical physics 
is sketched in Fig. 1.9. Since no equation describes physical reality absolutely 
exactly, we always make a modeling error eg. The true solution is usually searched 
in suitable function spaces whose dimension is infinite. However, for computer 
implementation we have to approximate them by finite dimensional problems, 
which yields a discretization error e,. This error may also include the error of 
numerical integration, error of approximation of the boundary of the examined 
region, interpolation of input data, etc. Finally, a computational error e2 arises 
during the computation of the discrete model itself on a computer. It contains, of 
course, rounding errors, but may include other errors (like iteration error, etc.). 

The errors eo, e€1, and e2 should be handled with care. They are generally not 
canceled. If just one is large, then the total error e3 = eg + e; + e2 is also large. 
Thus there is no point in trying to make another error very small. Therefore, it 
is computationally advantageous that all these errors are about the same size if 
possible. 


Remark 1.9 Some authors include a physical model between the physical reality 
and its mathematical model in Fig. 1.9. The corresponding error analysis is similar. 


Without any exception each equation of mathematical physics has some lim- 
itations on the size of the investigated objects. For instance, the standard heat 
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Discrete 
ey model ey 


Fig. 1.9 The modeling error ep is the difference between physical reality and its mathematical 
description. A discrete finite dimensional model differs from the mathematical model by the 
discretization error e;. Finally, in e2 are included computational errors (rounding errors, iteration 
errors, etc.) 


Numerical 
results 


Mathematical 
model 


Physical 
reality 


conduction equation (stationary or nonstationary) approximates very well the true 
temperature in solids of size about 1 meter, which can be verified by direct 
measurements [191]. However, in applying the heat equation on the atomic level in 
the cube with edge 10~!° m, we get nonsensical numbers, as well as in the cube with 
edge 10!° m, which would immediately collapse into a black hole (see Example 7.3 
below). Analogously, we should not apply Keplerian laws on scales of 107! m or 
the Schrédinger equation on objects that have the size of a cat. Similar arguments 
apply also for time scales. For instance, the standard predator-prey equation or the 
N-body problem on the interval of 10!° years do not describe the reality well. 
Therefore, in any calculation we have to take care of the modeling error which is 
small only under some restrictive space-time limits. 

In spite of that, when deriving the standard cosmological Friedmann equation, 
Einstein’s equations are applied to the entire universe. This is considered as 
a platitude and almost nobody deals with the question, whether it is justified 
to perform such a fearless cosmological extrapolation without any observational 
support, since general relativity has only been “tested” on much smaller scales like 
the Solar system (bending of light in the gravitational field of the Sun, measuring 
the curvature of spacetime near the rotating Earth by means of the Lense-Thirring 
precession effect, perihelion advance of Mercury’s orbit, Shapiro effect, etc.). Note 
that galaxies have a diameter of the order of 10!° astronomical units and the universe 
has at least five orders of magnitude larger diameter. 

Fritz Zwicky [385] and Vera Rubin [307] used too simple models and insufficient 
data involving diverse selection effects which led to the prediction of dark matter 
[183]. For instance, at the end of the last century astronomers believed that only 
3% of all stars are red dwarfs [29, p.93]. However, at present from the Gaia data 
we know that at least 76% of all stars are red dwarfs. Therefore, the proposed ratio 
27:5 between the amount of dark matter and baryonic matter is almost certainly 
overestimated. According to [196, p.442], the (normalized) Friedmann equation 
was derived under an incorrect cosmological extrapolation ignoring the modeling 
error. Of course, Alexander Friedmann when he published in 1922 his famous paper 
[108] had no idea about the size of our universe, since galaxies were discovered only 
in 1924 by Edwin Powell Hubble. Thus, it seems that cosmologists solve incorrectly 
extrapolated equations and then present their results very exactly (up to three/four 
significant digits). 
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1.6 Estimation of Modeling and Numerical Errors 


The difference between the true solution of a mathematical model and its numerical 
approximation is usually estimated using various a priori or a posteriori error esti- 
mates. This step is very important, since real-life problems are often nonlinear, large 
scale, ill-conditioned or unstable. They may have multiple solutions, singularities, 
etc. If we do not perform a reliable error analysis, we do not know, in fact, how close 
the numerical solution is to the exact one. First we will again deal with the N-body 
problem (1.14). 

Can we believe numerical simulations of the evolution of the Solar system based 
on (1.14) for billions of years in the past or future? 

The answer is NO for several reasons. Newtonian mechanics does not allow any 
delay given by the finite speed of gravitational interaction. System (1.14) is only a 
vector ordinary differential equation whose solution on the interval [0, 7] depends 
only on positions and velocities at the point tg = 0 and not on the history. An 
extremely small modeling error ¢ > 0 during one year may yield an error of order 
10° after 1 Gyr that may be quite large [161]. Also various nongravitational forces 
(Yarkovsky effect, YORP effect, electromagnetic forces, etc.) are not included 
in (1.14). The corresponding calculations over time intervals of several Gyr thus 
resemble computations of weather forecast for many months in advance. The reason 
is that system (1.14) is not Lyapunov stable in time, i.e., extremely small changes 
of initial conditions or other perturbations may cause very large changes in the final 
state provided the time interval is long enough. For N > 2 Isaac Newton stated: 

An exact solution exceeds, if I am not mistaken, the force of any human mind. 

Henri Poincaré knew that the N-body problem for N > 2 has an analytical 
solution only in a few special cases, but there is no explicit formula for its 
general solution (see Remark 4.8 below). Therefore, this problem has to be solved 
numerically by means of Runge-Kutta methods, multi-step methods, symplectic 
methods, etc., see e.g. [1, 184, 234]. For instance, we cannot calculate a reliable 
trajectory of our Moon in the Solar system on intervals longer than 10,000 years 
due to Lyapunov instability (see e.g. references in [340]). 

If an integration step h gives almost the same numerical results as h/2 the 
integration of the system (1.14) is usually stopped. However, this heuristic criterion 
does not need to work properly. Here we present another way how to check whether 
our numerical results are reliable. It is based on backward integration of (1.14) 
as sketched on Fig. 1.10. Let r = (r,..., ry) denote a vector function with 3N 
entries, i.e., each component r; of r is a three-dimensional vector function. Let 
f = f(@) stand for the right-hand side of (1.14) in a three-dimensional space. 
Then we can formulate the following theorem [162]. 


Theorem 1.3 Let r = r(t) be the unique solution of the system 


r=f(r) (1.16) 
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on the interval [0, T] with given initial conditions 
r(0)=ro and r(0)= v0. (1.17) 


Then the function s = s(t) defined by 


s(t) =r(T —-t) (1.18) 

solves the same system (1.16) with initial conditions s(0) = r(T), s(0) = —r(T) 
and we have 

s(T)=ro and s(T)=-—v9. (1.19) 


Proof According to (1.18) and (1.16), we obtain 
S(t) = (-F(T — 1) = rT —-t) = f(r(T —1)) = f(s). 


We see that s satisfies the same system (1.16) like r. For the final conditions 
by (1.18) and (1.17) we get relations (1.19), 


s(T) =r(0)=ro and s(T)=-r(T —T)=-r(0) = —vo. 


This ends the proof. Oo 


Example 1.3 Since the N-body problem is exponentially unstable, the above 
theorem can be applied follows. Denote by r* and s* a numerical solution of the 
system (1.14) with initial conditions (1.18) and 


s*(0)=r*(T) and s*(0)=-7*(T), 
respectively. If 5 > 0 is a given tolerance and 
[s*(T) — rol + [8*(T) + vol > 6, 


then we cannot claim that |r(7) —r*(T)|+|7(T) —7*(T)| < 6, where r is the exact 
solution, i.e., the numerical error of the original problem (1.14) and (1.18) would be 
also large, as schematically depicted in Fig. 1.10. 


Finally, let us point out that the previous Theorem 1.3 and computational strategy 
can be easily generalized to evolution partial differential equations. 

From (1.14) we observe that when two bodies are close to each other (rj ~ 
rj) which is an important case e.g. in space aeronautics, then we subtract in the 
denominator two numbers of almost the same size. This may lead to a catastrophic 
loss of accuracy (see e.g. [45, 162, 255] for several shocking examples). 

Moreover, numerical error generally grows exponentially with time. This fact has 
serious consequences. For instance, on April 13, 2029, the distance of the asteroid 
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Fig. 1.10 Application of 
Theorem 1.3 in numerical 
solution of the N body 
problem. The symbol r stands 
for the true solution, r* is the 
numerical solution, and s* is 
the control backward solution 


Apophis from the Earth will be about 30,000 km. However, for the next approach in 
2036 the distance cannot be reliably calculated, since the system (1.14) is unstable. 


Remark 1.10 There is a general belief in the current astrophysical community that 
Einstein’s equations best describe gravity [167]. However, their extreme complexity 
prevents one from verifying whether they model e.g. the Solar system better 
than Newtonian N-body simulations with N > 1. Hence, in this case Einstein’s 
equations are hardly testable in their general form, since their analytical solution is 
not known and they are, in fact, non-computable by present computer facilities, as 
we shall see in Sects. 5.1 and 5.3. Moreover, it will be obvious why we are unable 
to calculate reliably trajectories of planets by means of Einstein’s equations, e.g. in 
the Jupiter-Sun system. The reason is that the mass of Jupiter is not negligible with 
respect to the Sun’s mass. 

On the other hand, such trajectories can be calculated numerically very precisely 
on short time intervals by the Newtonian N-body simulations (e.g. with 8 planets) 
even though their analytical solution is not known. Thus the modeling error eo, 
which is the difference between observed trajectories and the analytical solution, 
is also not known. However, the modeling error eg (see Fig. 1.9) can be easily 
estimated by the triangle inequality 


leo| < lei] + le2| + les, 


where e3 is the total error which is the difference between observed trajectories 
and numerically calculated trajectories. The sizes of e; and e2 can be established 
by various a priori and a posteriori error estimates frequently used in numerical 
analysis. 


Remark 1.11 A light cone, i.e. a future cone, is the path that a flash of light, 
emanating from a single event (localized to a single point in space and a single 
moment in time) and traveling in all directions, would take through spacetime 
(cf. Fig. 1.11). Thus the N-body problem (1.14) does not satisfy the principle of 
causality: 


All real events necessarily have a cause. 


The reason is that in Newtonian formulation an infinite speed of gravitational 
interaction is assumed which is not physical. Hence, any information about changing 
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Fig. 1.11 Newton’s theory of gravitation does not satisfy the principle of causality, since it 
assumes that the speed of gravitational interaction is infinite. Thus if any body changes its position, 
then information about this occurrence immediately travels outside the corresponding light cone 
along the horizontal axis which represents the spatial coordinates (x, y, z), where two spatial 
dimensions are suppressed 


position gets immediately horizontally out the corresponding light cone. The 
modeling error of the speed of gravitational interaction is thus infinite. 


Remark 1.12 Newtonian celestial mechanics is designed in such a way that the 
laws of conservation of energy and angular momentum hold absolutely exactly. 
However, in Sects. 9.2, 9.3, and 10.4, etc., we shall see that this is not true for the 
physical universe. Moreover, on different scales we observe qualitatively completely 
different hierarchical structures (stars, planetary systems, globular clusters, spiral 
galaxies, galaxy clusters, galaxy filaments, cosmic web) that are stable for billions 
of years. This indicates that physical gravity is not scale invariant. 


Chapter 2 ®) 
Paradoxes in the Special Theory cree | 
of Relativity 


2.1 Short Introduction to the Special Theory of Relativity 


This chapter is based on our papers [168-171]. In Theorem 2.1 and Examples 2.2 
and 2.5 we show that a finite speed of light and the Doppler effect can produce more 
dominant and entirely opposite effects for relativistic speeds than those predicted 
by the Special Theory of Relativity (STR), in particular, the clock paradox, time 
dilation, and length contraction. This important fact is not emphasized in most of 
the books on STR. 

For instance, by STR a moving clock ticks slower than at rest. However, an 
observer will measure a higher frequency of an approaching clock than the same 
clock has at rest due to the Doppler effect. We also prove that under certain 
conditions an approaching bar on a photo may seem to have a larger length for a 
relativistic speed than at rest, even though by STR lengths of moving objects are 
contracted. 

According to Newton’s first law of inertia, a body will remain at rest or in uniform 
motion in a straight line unless acted upon by an external force. This fundamental 
physical principle serves to introduce inertial systems in STR, see [107, p.211]. 
Consider a fixed coordinate system S with orthogonal axes x, y, z containing a 
fixed system of hypothetical synchronized clocks that define the time coordinate 
t € (—0oo, oo) of a uniformly flowing time. The coordinate system S' is called 
inertial if it obeys Newton’s first law of motion. 

Let S’ be another coordinate system with orthogonal axes x’, y’, z’ which are 
for simplicity parallel with x, y, z and have the same scale at rest [384]. The time 
t’ € (—oo, oo) in S” is introduced similarly using a fixed system of synchronized 
clocks in S’ having also the same time scale at rest. Let the origin of S’ move along 
the x axis at a constant speed v € (—c, c), where c is the speed of light in a vacuum, 
see Fig. 2.1. 
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Fig. 2.1 The inertial system 
S’ is moving with speed 

v € (—c, c) with respect to 
the system S 


Remark 2.1 The basic postulate of STR that the speed of light c has the same size 
in all inertial systems was verified experimentally on the Earth by the well-known 
Michelson experiments [244]. The inertial systems S and S’ move equally with each 
other. 


Lorentz’s transformation (see [227] and also [293]) is a fundamental tool of STR. 
The parameter defined by 


1 
Vv 
-2 


is called the Lorentz factor. Notice that its reciprocal value fulfills the equation for 


the unit circle 
v\2 1 \2 
Ce 
c Yu 


Points of the four-dimensional spacetime are called events. This space is 
sometimes referred as flat, since it is not curved. Unless otherwise stated, we will 
restrict ourselves to one pair of the above described inertial systems, where the 
encounter of the origins of S and S’ determines the beginning of time counting in 
the first and in the second inertial system, respectively, i.e., f = 0 in S and ¢’ = Oin 
S’. In this special case the Lorentz transformation has the form Ly : (x, y, Zz, f) 
(x’, y’, Zz, t’), 


x! = yy(x — vt), (2.2) 
! 

youy, 
! 

Z=Z, 


 =y(t- 53), (2.3) 
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where x, y,z,t € (—00o, 00) and the last equality expresses how to transform a 
uniformly flowing proper time during the transition from S to S’. Events which are 
simultaneous in S are given by the identity t = fo, where fo is a fixed constant. 

By (2.3) we see that the time ¢’ depends not only on ¢ but also on the position 
x, ie., t’ is not constant and thus the corresponding events do not have to be 
simultaneous in S’ for v 4 0. 


Remark 2.2 Albert Einstein uses the transformation (2.2) and (2.3) in his pioneering 
paper [89, p. 902] from 1905, but does not use the term inertial. He also does not 
cite Lorentz’s paper [227, p. 185] from 1892 nor does he mention Hendrik Lorentz 
himself. Einstein probably knew Lorentz’s work [227], since the titles of their two 
papers are very similar. Moreover, Lorentz was very famous after receiving the 
Nobel Prize in 1902. 


Let us emphasize that any two different events which are simultaneous in S are 
not causally connected. An observer can verify that they were really simultaneous, 
e.g., when he is in the middle and their future light cones just intersected. 

Notice that the right-hand sides of relations (2.2) and (2.3) are linear functions in 
variables x and t for any fixed v. Thus, for x = (ct, x, y, z) and x’ = (ct’, x’, y’, z’) 
the Lorentz transformation can be rewritten into the matrix form 


x’ = Lyx, 
where 

v 

Yo ——Yy 00 
‘ Cc 

Ly = ee vy 00 (2.4) 
0 0 10 
0 0 Ol 


is a block diagonal symmetric and positive definite matrix. Note that the physical 
dimension of all entries of the vectors x and x’ is one meter. 

The inverse matrix L,} has a similar form as L,, only the two minus signs 
in (2.4) have to be replaced by plus. Therefore, the Lorentz transformation L, is a 
one-to-one mapping from E* onto E* for v € (—c, c). 

Let us point out that in the limit case |v| = c, the matrix (2.4) becomes singular, 
since its first two rows are linearly dependent. It this case, the inverse of £, does not 
exist. Consequently, the Lorentz transformation should not be applied to the surface 
of the light cone. 
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The relation (2.3) is to be understood as the time which we would record at the 
moment when the two clocks in S and S” are closely passing each other at one 
single x-coordinate (e.g. at the origin). So we can compare only time data t and 
t’ of local clocks, because the concept of the present is relative. By definition, all 
clocks in each inertial system at rest show the same time in the entire infinite three- 
dimensional space (e.g. at the beginning and at the end of a motionless bar). Thus, 
when we are exactly in the middle between any two fixed clocks, they will show us 
the same time. 
Consider a fixed time interval 


/ / / 


where t; are space-independent coordinates in S’. For an arbitrary fixed point x in S 
we determine the corresponding f and ¢; from formulae (cf. (2.3)) 


tb vo(t ) ty volt ) 
= 2— 5X), = i=], 
ae ce ' 7 2 


and we set At = f2—1t,. From this we get the well-known time dilation (see e.g. [133, 
p. 430]) 


v v 
At! = y(n - at t+ a) = py At. (2.5) 


By (2.1) we see that Ar’ > Ar for any v ¥ 0 independently of the sign of v. The 
relation (2.5) actually expresses that the time, measured by a clock located at a fixed 
coordinate x’ in a moving system S’, runs slower than the time measured by a clock 
that is at rest with respect to S. Hence: 


The clock at rest is fastest. 


The time dilation is usually theoretically justified as follows: A photon launched 
from the origin of the systems S and S’ at time t = t’ = 0 in the z direction flies 
obliquely in S’ with speed c, see Fig. 2.2. Therefore, in terms of an observer in S’ 
this photon needs longer time to reach the plane z’ = z = | than in terms of an 
observer in the system S. 


Example 2.1 The experimental verification of time dilation can be demonstrated by 
means of particles called muons whose mean half-life time at rest is tT = 2.2-10~° s. 
From observations of cosmic rays we know that if muons move linearly at almost the 
speed of light, they will travel on average a much longer distance than ct = 660 m. 
However, it should be emphasized that in the inertial system associated with muons 
their decay will not slow down. 
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Fig. 2.2 Two (blue) vA Z i 
trajectories of the same 
photon in the inertial systems 1 1 
S and S’ ey 
Ss Ss! DvD 
ed 
0 x 0 x 


Remark 2.3 The time dilation is also verified by means of the transverse relativistic 
Doppler effect, see e.g. [38]. The transverse relativistic Doppler effect was first 
measured by Ives [138] already in 1938. In classical mechanics, this transverse 
effect does not occur, because it is given by time dilation (2.5) only. Note that the 
Hafele-Keating experiment [123] with two atomic clocks in airplanes and one on the 
Earth is not too credible, since none of the corresponding three systems was inertial. 


Remark 2.4 Olaf Rgmer (in Journal des Scavans, 1676) suggested an elegant 
method to measure the speed of light. When the Earth moved toward Jupiter, the 
time interval between successive eclipses of Jupiter’s moon Io became steadily 
shorter with respect to the terrestrial time. When the Earth moved away from Jupiter 
these eclipses became steadily longer, i.e., they were behind the expected values. 
Rgmer thus actually found a phenomenon, which was later named after Christian 
Doppler [184, p. 28]. 


The non-relativistic longitudinal Doppler effect [83] is described by the relation 


Cc 


fo= aE (2.6) 


c—v 


where f is the source frequency at rest, v is the speed of the source approaching 
an observer along the axis x, f, is the frequency measured by the observer, and c 
is the speed of signal. For relativistic speeds this relationship needs to be corrected 
by time dilation [107]. All physical processes including clock speed in S’ will run 
by (2.3) slower when observed from S. Thus by (2.6), the new relation will be of 
the form 


f= it (2.7) 
where c is the speed of light and 


ie ae: (2.8) 
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Fig. 2.3 The observer onthe [J " 
right is at rest with his clock. 
He will see a higher 
frequency of ticks on the 
clock on the left approaching | ——v 
him at relativistic speed. This , 
is because the Doppler effect 
is greater than time dilation 


corresponds to the lower frequency calculated from (2.5). By (2.7), (2.8), and (2.1) 
we obtain a relativistic Doppler relation for the frequency detected in S (see [89]), 


v\2 
: ee 
= 1 i Vo = (<) _ cru 
Cc Cc Cc 


From this and (2.8) we immediately get the following theorem. 


Theorem 2.1 For any v € (0,c) we have that fy/f’ > fv/f > 1. Moreover, 
hiolf ~ wasvu->c. 


Consequently, the Doppler effect manifests more than the time dilation itself, 
whenever the clock approaches the observer, see Fig.2.3. Hence, the higher the 
speed v, the greater the Doppler effect. Special relativity effects for large v are of 
higher order in v/c than those arising from the Doppler effect. 

It is therefore very important to distinguish consistently between reconstructions 
(calculations by means of the Lorentz transformation) and observations (measure- 
ments, detections, photos, videos, and recording in general). The notion “observer” 
in STR is somewhat confusing. It should not be a person who only applies 
relations (2.2) and (2.3). The observer performs real observations and measurements 
including all effects together as it is usually understood, i.e., the observer measures 
incoming frequencies. 


Example 2.2. Suppose that a clock is approaching the origin of S at relativistic speed 
v = 0.8c. Its proper time passes slower than on clocks fixed in the system S, since 
by (2.5) we have 


1 5 


> oe 3 
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and 
p38 
At = =At. 
3 
However, substituting v = 0.8c into (2.9) and (2.8), we find that 


pesy wd f=sT; (2.10) 


i.e., the observer at the origin of S will detect a 3x higher (blue-shifted) frequency 
fv than the same clock has at rest in the system S, and this frequency f, is even 
5x higher than the frequency f’ which the time dilation predicts (see (2.8)). This 
may seem to be paradoxical, since due to the Doppler effect the observer sees an 
opposite effect than STR claims. 

For a clock receding the origin by the speed (—v), the observer will detect 
by (2.9) a 3x lower (red-shifted) frequency than f. So there is a jump in these 
constant frequencies at the origin and only at this single point the observer can 
theoretically detect the proper frequency f’. So the Doppler effect plays an essential 
role. 


Remark 2.5 The observer usually does not have a possibility to measure directly 
the speed v of some distant object so that he could immediately use the Lorentz 
transformation. However, he can measure the observing frequency /f, (blue-shifted 
or red-shifted) of some characteristic spectral line of a certain chemical compound 
and establish the corresponding rest frequency /, i.e. in the corresponding rest 
frame. From this and (2.9) he can establish the speed v (or its radial component 
in a general case). Then he can determine the factor vy and find how significant 
are the corresponding relativistic effects (2.2)—-(2.3). 


2.3 The Lorentz Transformation Does not Allow 
Superluminal Velocities 


First we recall Einstein’s formula [89] for a relativistic addition of velocities, see 
also [270, Chapt. I.6]. 


Theorem 2.2 (Einstein) Let u € (—c,c) and w € (—c,c) be constant velocities 
of a point-like object in the system S and S', respectively, in the direction of the 
horizontal axis. Then 


pa (2.11) 


where v € (—c, c) is a constant speed of S' with respect to S. 
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Proof Velocities u and w are constant in S and S’, respectively. Therefore, 


_ dx _ dx’ 


u= — a w= —. 
dt dt’ 


(2.12) 
By (2.3) we get 


dt’ (1 Ss (1 >) 
dt =e c2 dt =e cy? 


where the difference in parenthesis is obviously positive. Using this equality, (2.12), 
and (2.2), we find that 


dx’ dx’ dt dx 1 uv\—! u—v 
ea ae ae mF “ri (1- 2) — 7 av" 


Cc 1- os 
From this it follows that 
ae uvw 
= 3 
Now it is enough to evaluate u and we obtain (2.11). oO 


For example, by Theorem 2.2 we see that for v = w = dc the speed u = ee 


is less than c. In the next theorem we prove that from (2.11) we can never get the 
speed of light or faster-than-light speed u, even if |v| and |w| are arbitrarily close 
toc. 

Let us recall that a group G is a set equipped with an associative binary operation 
o: Gx G — G and with the neutral element e such that for any g € G there exists 
exactly one inverse element g~! € G for which 

gog '=e=g log. 

The composition o of two Lorentz transformations £, and Ly given by rela- 

tions (2.2) and (2.3) for v, w € (—c, c) is defined as follows 


Ly = Ly Oo Ly; (2.13) 


where u satisfies Einstein’s formula (2.11). 


Theorem 2.3 Lorentz transformations L, for all v € (—c, c) defined by (2.2)-+(2.3) 
form an Abelian group. 


Proof If v, w € (—c, c) then obviously 


(4)(0+e)=0 amt (1-20-80 
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From this we find that 


Po 
and thus 
v+w 
-—c< 7 <c 
ee) 


Comparing with Einstein’s formula (2.11), we see that u € (—c, c), i.e., |u| is always 
less than c. 

Using (2.1) for v = 0, we find that yo = | and the corresponding transformation 
Lo is the identity, i.e. the neutral element. 

From (2.11) and (2.13) we immediately get that 


Ly o Ly = Lo = Ly o Ly; 
where £_, is the inverse transformation, i.e., x = y,(x/ + vt’), t = y,(t/ + vx! /c?). 


The composition o is commutative, since the special block diagonal matrices L, 
and L,, defined by (2.4) are commutative, i.e. 


Uv W 
Yu —-y~w 00 Vw ——Yy 00 
v Cc w Cc 
L,Ly = et yw 00 vee Yw O00 
0 0 10 0 0 10 
0 0 Ol 0 0 Ol 


UW vu+w 
YoYw + a Yo Yw A VoYw 00 


u+w vw 
=| —— hte Www + ZVYw 99] = LL, 
0 0 10 
0 0 01 


for all v, w € (—c, c). The associativity of the operation o is due to the fact that 
matrix multiplication is associative. Oo 


Remark 2.6 According to [248], astronomers observe superluminal velocities of 
jets produced by black holes in our Galaxy. However, in [158, 184] we present a 
detailed mathematical justification of this paradox. We show that this is only an 
illusion and that the actual velocity of jets is less than c. 


Remark 2.7 The classical geocentric model with motionless Earth in the center 
contradicts the Theory of Relativity. To see this consider the planet Neptune which 
is 4 light hours far from the Earth. Hence, the length of its orbit is 27 - 4 > 24 light 
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hours. So Neptune would need a superluminal speed to orbit the Earth in one day 
which is impossible. 


2.4 Length Contraction 


Lorentz’s length contraction is an immediate consequence of the Lorentz transfor- 
mation. On the horizontal axis x’ consider a fixed bar which is at rest in the system 
S’. Denote its length by 

Ax! = Xs _ Xi (2.14) 
where x; are fixed time independent coordinates of its ends in S’. For an arbitrary 


fixed time instant f in S we determine the corresponding x2 and x; from formulae 
(cf. (2.2)) 


X> = Yy(x2 — vt), xy = Yy(x1 — vt), 
and we set Ax = x2 — x1. Substituting this into (2.14), we get (cf. (2.5)) 
Ax! = yy(x2 — vt — xy + vt) = yy Ax. (2.15) 


Denoting £9 = Ax’ and £ = Ax, we get by (2.1) the well-known length contraction 
y2 

€=bo,/1—— 5. (2.16) 
c 


The bar at rest has the greatest length. 


Hence: 


Remark 2.8 For the time being there is no direct experimental evidence of length 
contraction (2.16). Anyway, we can verify it indirectly by means of muons 
mentioned in Example 2.1. In the system S associated with these muons at rest 
we will observe their usual mean half-life time t = 2.2- 107° s. However, in S’ 
connected with Earth’s atmosphere they will travel a longer distance than ct = 660 
m. This demonstrates the length contraction in S’ as seen from S. 


In 1959, Roger Penrose published a paper [274] (see also [277, 337, p.431]) 
describing why a quickly flying non-rotating ball in a photo should again look like 
a stationary ball. In the same year, his thoughts were elaborated in more detail by 
James Terrell [342] using light aberration. Here is a specific example showing the 
substantial effect of light aberration for relativistic speeds. 
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A 
Afj-£€D £0 x 


Fig. 2.4 The length of the legs of the larger (or smaller) right triangle is 1 and 0.75 (or 0.4 and 
0.3)m. The ratio between the lengths of sides of both the triangles is 5 : 4 : 3. Due to light 
aberration the flying bar from the left to the right at the speed 0.8c has the same length in the photo 
as the same bar at rest. A photon emitted to the camera C from the rear end of the moving bar will 
have the same x coordinate as this rear end in this special case 


Example 2.3 Consider a bar with length £9 = 1 m. Assume that it moves from the 
left to the right along the axis x by the constant speed v = 0.8c and that its front end 
just reached the origin of the coordinate system S. By (2.16) the bar as seen from S 
is shortened to 


£= lov 1 — 0.64 = 0.6 m, 


and thus the length of the straight line segment AD in Fig. 2.4 is |AD| = 0.4 m. We 
will photograph this bar from the axis z by a fixed nonrotating camera C which is 
placed at the distance 


d=0.75m (2.17) 


from the origin. Using the similarity of right triangles from Fig. 2.4, we find that 
|BD| = |AD|d/£o = 0.3 m. From this we have 


|AB| = 70.42 + 0.32 = 0.5 


measured in meters. The segment on the hypotenuse from B to the camera C has 
the same length in meters as d in (2.17), 


Vv 12 +.0.752 — |AB| = 1.25 —- 0.5 = d. (2.18) 


To avoid blurred photos, we assume that our idealized camera can take pictures 
within | picosecond. During this time period, the light will only fly 0.3 mm and a 
possible blurring will not play a significant role. For simplicity, we shall analyze 
only that photo, in which the front end of the bar just reached the coordinate origin 
of S. However, the rear end of the bar will be on the photo farther than @, since the 
light from the front end flies along a shorter distance d than the light from the rear 
end (see Fig. 2.4). That is why there will be recorded photons on the photo from the 
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rear end of the bar that were emitted earlier than those from the front end. During 
the time period, when the rear end of the bar moves from A to D, a photon pointing 
from A to the camera will travel the distance |A B|, since v/c = |AD|/|AB| = 0.8. 
Hence, thanks to light aberration and (2.18) the moving bar will have on the photo 
the same length as the fixed one meter long bar. 


Let us point out that a photon will travel the distance d from the origin to the 
camera C during the time period At = d/c. During this period, the bar will shift 
about vAt = 0.8d = 0.6 m, i.e., it will be placed entirely to the right of point 0. 


Example 2.4 Let again £9 = 1 m and v = 0.8c. Hence, € = 0.6 m. This time, 
however, we place the camera closer to the axis x, i.e. d < 0.75 m. We shall again 
analyze the image, where the right end of the bar is at the origin. The left end of the 
bar will shift from the point A = (—a, 0) to the point (—@, 0) during the time period 
At = (a — £)/v. During this period, a photon will travel the distance cAt from the 
point A to the camera C. From the relation aa+d? = ((a—- L)c/u+ d)? we can 
derive the following inverse formula 


(= - <) 4 2ae= — 22£ 
go cov U ) 
2(a — £) 
For instance, when a = 2 m we obtain d = 3 = 0.26... m. So if we place the 


camera on the axis z at a distance of 26 cm from the origin, the one meter flying 
bar will appear extended in the photo as two meters long. Similarly for v = 0.9 and 
d = 4cm we even get a = 4 m. The main cause for these surprising phenomena 
is that photons, which simultaneously passed through the lens, were not emitted 
simultaneously in S. 


For d > 0.75 m we shall see the bar in the photo as shorter than | meter. The 
same bar will also be shorter than @, if we photograph it such that its left end is at 
the origin. If it is placed exactly symmetrically with respect to the origin, its length 
on the photo will be just @, but nonlinearly deformed. 


Example 2.5 Now let us consider a 2 m long moving bar (€9 = 2 m) again with 
speed v = 0.8c, and let d = & = 0.26... mas in Example 2.4. When the bar is 
exactly symmetric with respect to the origin (cf. Fig. 2.5), the length of its left part 


Fig. 2.5 Left: the front end 
of the (blue) bar reaches the 
origin. Right: the rear end of 
the (red) bar reaches the 
origin, where 


|BC| =|CG| =d, 
|DE| = 22, and 
v|FG| =clEF| 


At—t D ~ 0 FE x 
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will be 2 m on the photo due to Example 2.4. Since its right part has positive length, 
we will see that the bar is longer than 2 m on the photo. 


Approaching objects show a blue shift (i.e. shortening the wave length). How- 
ever, due to aberration they may seem to be prolonged, which is paradoxical. On 
the other hand, receding objects that are manifested by red shift may seem to be 
shortened. Due to Examples 2.4 and 2.5 some pictures from the popularization book 
[112, Chapt. 1] are confusing. In the article [373], Weisskopf describes an apparent 
deformation of a quickly flying cube on a photo. 


2.5 The Twin Paradox 


In a series of publications [103, 112], [249, p. 167], [250, 253], ... the twin paradox 
(also called the clock paradox) is described by means of time dilation as follows: 

There is a set of twins, Adam and Bob. Adam stays on Earth, while Bob flies 
in a rocket e.g. to the star Sirius about 8 light years away with constant relativistic 
speed v. Bob’s time runs slower with respect to Adam’s clock due to time dilation, 
and his distance to Sirius is less than 8 light years due to length contraction. When 
Bob returns with velocity (—v) back on Earth, he finds out that his brother Adam is 
much older than him. 

Now let us look at the twin paradox in more detail. First we present several false 
arguments sometimes circulating among the general public and on internet. 

Sometimes it is incorrectly claimed that Bob will return as old as Adam. Bob 
seems to be at rest in his rocket and thus it is Adam who moves. Hence, Adam’s 
time runs more slowly than Bob’s time due to time dilation (2.5). This symmetry 
holds for all inertial systems. However, we have to realize that Bob has changed his 
inertial system when he was near Sirius, and thus this situation is not symmetric 
with respect to the two twins. Therefore, the above reasoning is not correct. 

To present another incorrect idea, we first present the following definition. The 
redshift z of an object resulting from its radial motions is defined by 


where A; and 1; are the observed wavelength and frequency, A9 and vo are the 
emitted wavelength and frequency in the rest frame, and A;v; = c fori = 0, 1. If 
z < 0, then z is called the blueshift. Note that the coordinate z appearing in (2.2) 
has a different meaning. However, no ambiguity can arise. 

Some people argue as follows: Adam first observes the redshift of the receding 
twin Bob due to the Doppler effect, i.e., Adam sees that Bob’s clock ticks slower. 
After the turnover at Sirius, Adam will watch the blueshift of Bob’s clock. These 
phenomena are thus balanced and the twins will be the same age. This reasoning is 
also not correct, since the time dilation (2.5) was not taken into account. 
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Fig. 2.6 The vertical axis ¢ t 
shows the time in years and 

the horizontal axis x shows 20 
the distance in light years. 
The left vertical line is the 
world line of Adam who stays 
on Earth. The right vertical 
line corresponds to the world 
line of Sirius. The world line 
of flying Bob is marked with 
a thick piecewise linear line 
given by equations t = 3x 
and tf = 20 — 3x. The future 10 
light cone t = |x| is marked ‘ 

by the dot-and-dash line and 

the dashed lines stand for 

events with simultaneous 

times in S’ and S$” such that ¢’ : 

and t” are constant \ 36 


16.4 


> 
x 


Another false argument is that when Bob changes the inertial system, he feels 
an infinite acceleration during the turnover at Sirius and that this is the main cause 
of the twin paradox. This is also not true. In more detail, Bob’s proper time t’ (and 
also t”) was not taken into account correctly as we shall see in Example 2.6. From 
Theorem 2.1 and Example 2.2 we know that the Doppler effect plays an important 
role in STR. Bob feels an infinite deceleration! during his turnover at Sirius. The 
reader is confused that this is the main reason for the twin paradox, since Bob 
changes its inertial system. However, this cannot be true. When Bob reaches Sirius, 
he can just transmit information (e.g. a photo) about his actual age in the system S’ 
to another traveler who is passing by Sirius and is fixed in the system S” associated 
with speed (—v). The same applies for Bob’s departure and arrival back to the Earth 
when he also feels an infinite acceleration or deceleration. 

To be more specific, assume that v = 0.8c. Sometimes it is also said that Bob 
will observe a large jump in time on Earth, since no event from the red interval (3.6, 
16.4) on the vertical axis in Fig. 2.6 is simultaneous with Bob. In Example 2.6, we 
will show that Bob will see only a jump in frequencies and no jump in time in $ 
during his turnover (see Fig. 2.7). 


' Roger Penrose [277, p.421] rounds the corresponding world lines on a small neighborhood 
around the three critical points to avoid an infinite acceleration or deceleration. 
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0 8 x/ ty 0 8 xi y 
Fig. 2.7 The notation is the same as in Fig. 2.6 except for dashed lines. Here dashed lines indicate 
trajectories of photons launched every year by Adam and Bob. Left: Adam sends a periodic signal 


to Bob. Right: Bob sends a period signal to Adam. There is no jump in time—only a jump in 
received frequencies 


2.6 Correct Solution of the Twin Paradox 


To solve the twin paradox we first prove the following theorem. 


Theorem 2.4 The difference (ct)? — x* is invariant with respect to the Lorentz 
transformation. 


Proof From (2.2) and (2.3) we see that 
(ct’)” -%7= (ct! — x’)(ct’ +. x’) 


UX Ux 
= Yo(ct = = vt) yo(et —-—+x- vt) 
c Cc 


v v 
= ye(1 + Jct (1 )ect + x) = (ct)? — x”. (2.19) 

c c 
Hence, this difference of squares is invariant with respect to the Lorentz transfor- 
mation. Oo 


Example 2.6 Let again v = 0.8c. (For general v € (—c,c) we may proceed 
analogously.) Since Bob is at rest in the system S’, we have that x’ = 0. Thus 
from (2.19) we get for At = 10 yr and Ax = 8 ly 


cAt’ = V(cAty? — (Ax)? = V100— 64 ly = 6ly. 
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Consequently, Bob will spend 6 years flying to Sirius according to his proper time 
(see the right part of Fig. 2.7). Thus his clock at Sirius will show 6 years. After the 
turnover, Bob will fly the same distance back to Earth at speed (—v) for another 6 
years with respect to his own proper time. On arrival, he will be 20 — 2-6 = 8 years 
younger than his twin Adam. 


Finally, let us see how the frequency of received signals changes. The Doppler 
effect, which is not included in the relations (2.2) and (2.3), cannot be removed 
during observations. Adam and Bob have the same clocks and let fo be their 
frequency at rest. When Bob moves away from Adam at speed v, then according 
to the relativistic Doppler effect Adam will detect the following frequency in S 


C—U 
f=, = 5 10: (2.20) 


Substituting v = 0.8c, we find that Adam will observe 3x lower frequency from 
Bob’s clock, since 


Similarly we find that after the turnover of Bob, Adam will observe the frequency 
to = 3fo, see (2.10) and Fig. 2.7. Adam will receive the same frequencies from 
Bob due to the relativity principle. Bob will see all instants on Earth even though 
with another frequency, i.e., no time interval will be skipped. 


Example 2.7 For simplicity assume that Niel Armstrong was flying to the Moon 
and back by the constant speed v = 10 km/s and that Ax = 384,000km is the 
mean distance to the Moon. When he returned from the Moon, how many seconds 
younger was he? 

His one way flight lasted 


Ax 


At = —— = 38,400 s (2.21) 
Vv 


with respect to Earth’s clock. Then by (2.20) we get 


Ax)2 
At! =,/(An? — ( a = 38,4002 — 1.282 s = \/1,474,559,998.3616 s 
Cc 


= 38,399.999,978,666 s. 


From this and (2.21) we obtain 


At — At’ = 0.000021344 s. 
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For the return trip we get the same value. Hence, according to STR, Armstrong has 
returned from the Moon 43 js younger than if he had remained on Earth. 


Example 2.8 Now let us take into account the Earth’s gravitational field and, for 
simplicity, neglect the Moon’s gravitational field. According to General Theory of 
Relativity (GTR), any clock on the Earth’s surface ticks slower than on the Moon. 
Let us ask the same question as in Example 2.7: How many seconds younger was 
Armstrong upon his return from the Moon? 

For the emitted frequency fem in the gravitational field and the detected fre- 
quency f in zero gravitation the following relativistic relation holds (see e.g. [327, 


p. 261]) 
f = fem, 1 — = (2.22) 


where we substitute for R = 6373 km the mean radius of the Earth and for 
r = 2GM/ c? = 8.87 mm the Schwarzschild radius of the Earth (cf. (3.10) below), 


where G is the gravitational constant and M = 5.972-10*4 kg is Earth’s mass. From 
this we get 

Jap 65g 

Sem 


During the time interval 2At = 76 800 s (see (2.21)), the clock on Earth is delayed 
by 
rd 
2A2(1 = =) = 50 ps. 
f 


em 


By comparison with the previous Example 2.7, we find that Armstrong returned 
even older than if he had remained on Earth. In this case, the effects of Special 
and General Relativity are subtracted from each other (similarly to the precise GPS 
setting). 


Example 2.9 Albert Einstein asked: Is a clock at a pole faster than one at the Earth 
equator? This example can be solved similarly as Example 2.8. Therefore, we will 
concentrate only on the main points. The equatorial radius of the Earth is Ry = 
6,378,135 m and polar Rz = 6,356,750 m. Assume that the clock on a pole has a 
frequency f = 1 s~!. The clock on the equator has the speed v = 27 R/T = 465.1 
m/s, where T = 86,164.1 s is the length of the sidereal day. We may suppose that 
they are in an inertial system on a very short time interval. According to (2.8), 
the frequency f’ satisfies f’ = % f, and thus for the difference of these two 
frequencies we find from (2.1) that 


, v2 {21 
f-fi=(1- 1-— ]f =1.2-10 Ss 
Cc 
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By (2.22) the frequency f of a non-moving clock on the equator with radius R, is 
f= 7Sta:ie 5 * 


higher than is the frequency f of the clock at the pole whose distance from the 
Earth gravity center is Ro, In this case, GTR produces a larger effect than time 
dilation proposed by STR. 


Finally, let us ask the question: For what speed v is the time dilation equal to 
the slower motion of a clock in the gravitational field of some body, e.g. the Earth, 
Mars, Jupiter? The answer is given by the next theorem. 


Theorem 2.5 Assume that an observer is far from all gravitational sources. Time 
on the surface of a spherically symmetric body B with mass M runs as fast as 
the observer’s own time moving relative to the body B at the escape velocity v = 


J2GMTR. 


Proof According to time dilation (2.5), the ratio of the corresponding frequencies 
satisfies the equality f’/f = y~!. Thus by (2.1) and (2.22) we require that 


ce | eee 
ce R 


5) rc 2GM 
v= — = ——_. 
R R 


Hence, we have 


The balancing of both the times therefore occurs for the speed 


/2GM 
v=4/-——. 
R 
This relation is the well-known escape velocity which is necessary to leave the 


gravitational field of a given body from the radius R. oO 


In the monograph [341, Sect. 3.5], one can find what the escape velocity means 
in General Theory of Relativity. 


2.7 Conclusions 


The Special Theory of Relativity has a number of unexpected claims that contradict 
our intuition. According to STR, no experiment can be made to decide whether the 
body is at rest or moving. All inertial systems for describing physical phenomena 
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are equivalent and there is no preferred inertial system. However, at present we 
know that the cosmic microwave background radiation (CMB) actually determines 
a certain kind of a fixed reference system in our neighborhood. Thus there arise 
speculations whether the principle of relativity in the physical universe holds. 


Remark 2.9 In the limiting case w = +c, Einstein’s formula (2.11) for v € (—c, c) 
gives u = c. Hence, every photon always has the speed of light in any inertial 
system. 


It is often said that the Lorentz transformation for low speeds |v| << c changes 
into the Galileo transformation 


This is not true (see [21, 355]), since for an arbitrarily small fixed v > O we can 
always find x such that the term vx/c? in (2.3) will dominate significantly over t. 
Nevertheless, from (2.2) and (2.3) it follows that the Lorentz transformation changes 
into the Galileo transformation for a fixed v, if we treat c as a parameter and assume 
that c — oo. In this case the term vx/c? becomes negligible. However, for an 
infinite speed of light there would be no Doppler effect nor aberration of light. 


Remark 2.10 For X = (x, y, z) and a constant velocity vector 0 € E? with length 
|v| € (0, c) the general Lorentz transformation is of the form (see e.g. [133, p. 434]) 


sf yrds % 5 

x =x+ re “x —yt)v, (2.23) 

f= y(t = —). (2.24) 
Cc 


Here the Lorentz factor y is defined similarly as in (2.1), only v* needs to be 
rewritten as |v|?. It is easy to find that for nonzero v = (v, 0, 0), where v € (—c, c), 
relations (2.23)—(2.24) change to (2.2)-(2.3). By Wikipedia [389] (see also [12]), the 
Einstein addition of velocities is neither commutative nor associative, in general. 


We conclude by stating that the longitudinal Doppler effect and aberration 
of light may cause that we observe completely opposite phenomena than those 
predicted by the Special Theory of Relativity by means of (2.2) and (2.3), 
see Examples 2.2 and 2.5. Note that relations (2.2) and (2.3) represent only a 
transformation of spacetime coordinates of points from one inertial systems into 
spacetime coordinates of the second inertial system. We saw that some other effects 
than time dilation and length contraction can manifest itself in a stronger manner 
and these effects cannot be shielded in any way. For a visualization of several further 
accompanying effects (like nonlinear distortion) we refer to [388]. 


Chapter 3 Mm) 
Einstein’s Equations om 


3.1 Historical Facts of Importance 


Karl Schwarzschild was probably the first scientist who ever realized that at any 
fixed time instant, our universe might be non-Euclidean and that it could be 
modeled by the three-dimensional sphere S° (called also a hypersphere) or the three- 
dimensional hyperbolic pseudosphere H*, see his paper [320] from 1900. Around 
1915, he became famous for calculating the first nontrivial solution of Einstein’s 
vacuum equations (cf. (3.6) below). 

Albert Einstein replaced the Newtonian gravitational force (1.9) by a curved 
spacetime, where all bodies move along geodesics—the shortest paths between two 
nearby points. This was a genial concept of how gravitational forces work (that 
resembles the well-known Fermat principle from optics). On November 18, 1915, 
Einstein submitted his famous paper [90] about Mercury’s perihelion shift. Here the 
gravitational field is described by the following equations 

Peel phe Os. Heras, (3.1) 
where the present standard notation and Einstein’s summation convention over all 
the values of repeated indices are used, 


4 1 ae 
r pv 78 (Zva,u + Sry, — Syv,r) (3.2) 
are the Christoffel symbols of the second kind (sometimes also called the connection 
coefficients) and gyy = Bale, x!, x?, x) are dimensionless components of the 
unknown 4 x 4 twice differentiable symmetric metric tensor (represented by a 
nonsingular matrix) of the spacetime of one time variable x° = ct and three space 
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variables x!, x2, x3, and 


det(g,)) = —1. (3.3) 


Furthermore, g/” is the 4 x 4 symmetric inverse tensor to g,,). For brevity, the first 
classical derivatives of a function f = f(x°, x!, x*, x3) are denoted as follows 
fx t= 0f/dx*. For simplicity, the dependence of all functions on the spacetime 
coordinates will often not be indicated. An infinitesimally small spacetime distance 
ds is usually expressed in the literature on General Relativity as follows 


dx 00 801 02 go3\ /dx° 
dx! | | g10 gu giz gis | | dx! 
dx? | | 920 g21 g22 g23 | | dx? 
dx37 \g39 931 932 933/ \dx3 


ds? = Gayle de” = 


Note that ds? means (ds)? and not d(s”). Let us also point out that the number of the 
Christoffel symbols appearing in (3.1) is 10(4 + 4-4) = 200. For the fundamentals 
of tensor calculus, we refer e.g. to [107, 249]. 

On November 20, 1915, David Hilbert submitted the paper [129] which was 
published on March 31, 1916. He did not require the validity of the restrictive 
algebraic constraint (3.3). Using a variational principle, he derived the following 
complete equations (see Fig. 3.1) 


Reel ag +04, —-M,0%, =0 (3.4) 


Te 


for the gravitational field, where the doubly underlined terms do not appear in (3.1), 
see Remark 3.2 below. Note that — {4} stands for the Christoffel symbol I”, 


Gravitationsgleichungen nur zweite Ableituangen der Potentiale 
9” enthalten, so mu8 H die Gestalt haben 


H= K+L 


wo K die aus dem Riemannschen Tensor entspringende Inyariante 
(Kriimmung der vierdimensionalen Mannigfaltigkeit) 


kK = Bo Ke 


m= 26) ate) te) 


Fig. 3.1 Hilbert’s original paper (see [129, p. 402]) contains the Ricci tensor R, from (3.4) which 
is denoted here by K,,, a K is the Ricci scalar. Hilbert’s paper was submitted for publication 5 days 
earlier than Einstein’s note [91] 
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in Fig.3.1. Hilbert’s main results are formulated in mathematical theorems with 
precise assumptions and statements. 

From (3.2) we observe that (3.4) is a system of nonlinear second order partial 
differential equations. Its left-hand side R,,, is at present called the Ricci tensor 
and the equations R,,, = 0 are called Einstein’s vacuum equations. By Sauer [317, 
p. 569], Hilbert’s knowledge and understanding of the calculus of variation and of 
invariant theory readily put him into a position to fully grasp Einstein’s gravitational 
theory. 

On November 25, 1915, Albert Einstein submitted a three and a half page note 
[91] which contains the same expression of the Ricci tensor as in (3.4), but the 
condition (3.3) is kept. The way in which Einstein’s equations (see (3.33) below) 
were derived is not described there. Einstein’s short note [91] was published on 
December 2, 1915, i.e. within one week. 

Now we show that the classical Minkowski metric tensor defined by 


Suv = diag(—1, 1,1, 1), (3.5) 
where all non-diagonal entries are zeros, is a trivial solution of Einstein’s vacuum 


equations. 


Theorem 3.1 The Minkowski metric tensor (3.5) is a solution of Einstein’s vacuum 
equations (3.4). 

Proof Since all Christoffel symbols (3.2) are zeros, the theorem immediately 
follows. oO 


In the same way we can prove that any other 4 x 4 constant symmetric tensor is 
also a solution of Einstein’s equations (3.4). 


3.2. Exterior Schwarzschild Solution 


On January 13, 1916, Karl Schwarzschild submitted the paper [321] containing the 
first nontrivial static spherically symmetric solution of Einstein’s vacuum equations. 
His solution is at present written as the following diagonal tensor 


di ( r—S r 
= diag( — ——, 
Bye é r r—S 


2, r? sin? 6), (3.6) 


where S > 0 is a fixed real constant and (r,0,q@) are the standard spherical 
coordinates such that 


x! =rcos¢cos0, x? =rsingcos0, x? =rsind, 
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r > S,@ € [0, 27), and 6 € [0, 2]. The time independent metric tensor (3.6) is 
called the exterior Schwarzschild solution of Eq. (3.4) and we have 


c7dt* = —c?(1 — S/r)dt? + (1 — S/r)7!dr? + r7d0? +r? sin? 6 d¢?. 


We see that for § = 0 the formula (3.6) reduces to the Minkowski metric tensor in 
spherical coordinates. In this case, (3.6) is called the isotropic metric tensor. 


Remark 3.1 On December 22, 1915, Karl Schwarzschild had already written a 
letter to Einstein announcing that he was reading Einstein’s paper on Mercury and 
found a solution of the field equations (see [90] for the English translation of this 
letter). Schwarzschild’s solution looks like (3.6). However, the meaning of the radial 
coordinate r is slightly different in his original paper [321] than in Theorem 3.2 
below. 


On March 20, 1916, Einstein submitted his fundamental work [92], where 
General Theory of Relativity was established. One year later, Einstein in [94] added 
the cosmological term Ag,,, to the right-hand side of his field equations (see (3.52) 
below) and in 1917 Willem de Sitter [80] found their first vacuum cosmological 
solution which describes the behavior of the expansion function on a closed universe 
with no mass. In 1932, Einstein and de Sitter [96] published another cosmological 
solution with positive mass. 

Classical relativistic tests [90, 237, 249, 324, 377] (such as bending of light, 
Mercury’s perihelion shift, gravitational redshift, Shapiro’s fourth test of General 
Relativity) are based on verification of very simple algebraic formulae derived 
by various simplifications and approximations of the Schwarzschild solution (3.6) 
which is very special and corresponds only to one spherically symmetric nonrotating 
body. For instance, the bending angle @ = 1.75” at the distance R = Ro is obtained 
from the relation 


Se 4GM 
~ ¢2R 


(3.7) 


which is derived from (3.6) under several simplifications, see [332] (cf. also [222, 
p. 11]). Relations (4.2) and (11.10) below represent further simple formulae that 
are tested. Therefore, in this chapter we will first present several theorems on the 
exterior Schwarzschild solution. 

Einstein and also Schwarzschild assumed that the gravitational field has the 
following properties: 


1. It is static, i.e., all components g,,, are independent of the time variable x 
2. It is spherically symmetric with respect to the coordinate origin, i.e., the same 
solutions is obtained after a linear orthogonal transformation. 
. The equations gp,0 = gop = O hold for any p = 1, 2, 3. 
4, The spacetime is asymptotically flat, ie., the metric tensor g,, tends to the 
Minkowski metric tensor at infinity. 


OW 
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Theorem 3.2 (Schwarzschild) The metric tensor (3.6) with S > 0 satisfies 
Eq. (3.4), but does not satisfy (3.1). 
Proof We will proceed in three steps: 


1. At first, we introduce the following definition of the Christoffel symbols of the 
first kind 


1 
Py = 5 Bau + 8va, — Buva)s A, u,v =0, 1, 2, 3. (3.8) 


From this and the relation g,,, = gy, we find that the Christoffel symbols are, in 
general, symmetric with respect to the second and third subscript 


1 
Pau = 7 (Savn + Spar — Svp,a) = avy. (3.9) 


Hence, their number is not 4 x 4 x 4 = 64 but only 4 x (1+2+3+44) = 40. 
Using (3.8) and (3.6), we obtain 


Teor = 2 _ Fe=r=a) 8 
001 = 3800.1 = aya = 52! 
T100 = —4 ae 
100 = — 7800,1 = 5p? 
Ti = 3 = 2 
Hl = 58 = Mr — Sy 
Ti2 = 5822.1 =r, 
1133 = — $8331 =-—rsin’6, 


P2021 = 3822,1 = 7, 
1233 = 3833,2 =r’ sind cos, 
1331 = 4933,1 =r sin’ 0, 
1332 = 5933,2 =r’ sind cosd, 
and the other components I,,,, = T',y, are zeros. 
2. Similarly to (3.9) we can find that the Christoffel symbols of the second kind 


are symmetric with respect to the second and third (lower) indices and by (3.2) 
and (3.8) we have 


ee = BOT pas 


44 3 Einstein’s Equations 


Since the metric tensor (3.6) is diagonal, its inverse reads 


r r—-S 1 1 ) 


"= diag( ; — 
. Fas PP r2 sin? 0 


and all non-diagonal entries are zeros. From this and Step 1 we get 


S 

Po. = pMPp ag, = 

ee a 2r(r — S) 
S(r — S$) 

r! = IIp = : 

00 — 8 100 = 
S 

Tr = IIp re, 

ee p 2r(r — S) 


Vy =e Tin =—-(r — $), 


r, = ¢'Tisz3 = —(@r — S) sin’, 
1 1 

Oh. mt 3222 _ _ 

M2 =8 Epa or = 


Pee = 91233 = sin@ cosé@, 


1 1 
r = gT331 = ———,_PL sin’ 6 — 
3 : ’ 
; r? sin? 0 r 
1 
3 33 ye 
Io = eo T3392 = a ee ord sin@ cos @ = cotan6, 
r- sin’ @ 


and the other components I”, = I"), are zeros. 


3. Finally, we will evaluate all entries of the Ricci tensor defined in (3.4). In 
particular, using formula for the inverse g“” in Step 2, we obtain 


Roo = M00,2-Tox,0 + Mole — Poel ho 


1 1 7 3 1 0 
=P ooi tl oo 2 + P43) — 2P ool 01 


S2r3 — S(r — S)6r? Se ) S(r — S) S 


4r® 2r3 r a. r 2r3 ar(r — S) 


rS 3S7—S), S(r-S)2 Ss? _ 
2r4 2r4 2r3 or) Ort 


0, 
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where the doubly underlined terms correspond to the doubly underlined terms 
in (3.4). From this it is obvious that the Schwarzschild solution (3.6) does not 
satisfy (3.1), since S(r — Sie #Ofor S > 0. 

Similarly, we find that 


Ry =e + ra 


1 0 1 2 3 1 /p0 2 3 
Sai ort P a Pea HP Fd wel et) 


0 po 2 p2 3 p3 
=~ Pal ot Pel te — Pil 3 


SQr—S) 2 S S 2 Se 2 
~F72 z+ ( 7 ) 2 2° 52 
2r-(r—S) r 2r(r—S)\2r(r—S) r 4r-(r—S) r 


_ px 4 Xr 54 Xr 54 
Roz = V9 pe -P a2 + poy — Ton y2 


1 3 1 72 3 2 pl 3 73 
= P99 1-932 +P an 42 + 43) — 20a P99 — M930 93 


1 2 2 2 
=-l1+- 5 (r — S)— + -(r — S) —cotan*é = 0, 
sin* 6 r oer 


R33 = 153 14,3 + 30 ay — Pa, 0% 


403 2 13 3 al 3 2 
= 13313 + 133093 — P31 33 — P39133 = 0, 


and the other non-diagonal entries of R,,, are also zeros. Therefore, (3.4) is 
satisfied. 
Oo 


The above theorem nicely demonstrates Schwarzschild’s ingenuity to find a non- 
trivial solution of a very complicated system of partial differential equations (3.4) 
for an arbitrary constant S > 0 (see also [107, 167, 249, 369]). For a spherically 
symmetric nonrotating object with mass M > 0, the constant 


_ 2MG 


° 2 


(3.10) 
c 


is called its Schwarzschild gravitational radius, where G is the gravitational 
constant and c is the speed of light in a vacuum. Applying the Newtonian formula 
for the escape velocity v = ./2MG/r from a circular orbit with radius r = S, we 
find that v = c. 
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Remark 3.2 Denote by 
Xr 
Sv = lage son pv oo 


the doubly underlined terms in (3.4). By Einstein [91] one additional algebraic 
condition (3.3) on volume surprisingly guarantees that the 10 differential operators 
Suv = Spy Simultaneously vanish when (3.4) is valid, see [107, p. 73]. The reason 
is that 7%, = (In./—g),,, where g = det(g,,,) = —1. Since In1 = 0, we find 
that P’,,, = 0, and thus (3.1) holds, i.e., the doubly underlined terms in (3.4) will 
disappear when (3.3) is valid. 


For positive numbers R < Q consider the spherical shell 
ies eer | Pee yee y+ey <0 


with interior radius R and exterior radius Q (see Fig. 3.2). Its volume in Euclidean 
space E? is equal to 


V= sO" ey: (3.11) 


Now we will derive a formula for the proper volume V of the spherical shell with 
coordinate radii R < Q ina curved space around a nonrotating ball with mass M 
and coordinate radius R. Here the tilde indicates a curved space. It is really curved, 
even though it is empty (modeling a vacuum). By (3.6) we find that the exterior 


Fig. 3.2 Spherical shell 
between two concentric 
spheres 
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space volume element is equal to 


dv =_/——ar- (rsin6 dd) - (r dé). 
r—S 


Therefore, by Fubini’s Theorem [296, p.581] the proper (relativistic) volume is 
defined as 


_ Q Tr 1 Qn Q r 
= r2 | ar. | (/ sin6 dp do = 4x [ r2 | dr. 
R r—S 0 0 R r—S 


(3.12) 


In [182] we prove the following astonishing theorem. 


Theorem 3.3 Let M > Oand R > S be any fixed numbers and let (3.10)-(3.12) 
hold. Then 


V-V>->wo aQ->ow. 


Proof The primitive function (i.e. antiderivative) to (3.12) is given e.g. in [120, 
p. 97] and by differentiation we can verify that 


2 2 3 
./ 7 me i SSr 58 5S = _ 
ye —ar=(5 + DD + 3 ) rir —S)+ 16 nQVr(r — $)+2r-—S). 
From this, (3.12), and (3.11) we get 


rs ae Ee 4 3 3 
V-V=4n r dr (QO R’)= 
R r—S 3 


4 5S 15S 
Z(G?) oor 5) len 2/0 -5)+20-5) 


3 4 


eae /R(R — S) ae (2,/R(R — S) + 2R — S) 


4 

~gi4 R’). (3.13) 
We see that the relation (3.13) was derived from the Schwarzschild solution (3.6) 
exactly without any approximations. The difference of the two terms containing 


In in (3.13) is positive, because 


Q>R>S8 
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and the logarithmic function is increasing, (Since In a—In b =In(a/b), the 
argument in parenthesis is dimensionless.) Thus from the inequality 


VO(O-S)>Q-S (3.14) 


we get the following lower bound 


2 2 2 
Ve One Sa) Hepes 


c, 
co 3s a 3 


where C contains all remaining terms not depending on Q and where C = C — 
1553/8. Letting Q — oo, we obtain the statement of the theorem. Oo 


Consequently, a natural question arises: How large can Q be so that the 
relativistic relation (3.12) approximates reality well. According to astronomical 
observations, space is locally almost flat at any fixed time, and thus to consider a 
very large Q makes good sense. 

From Theorem 3.3 we see that the difference of volumes V — V increases beyond 
all limits for Q — oo, which is a quite surprising property. Namely, this theorem can 
be applied for instance to a billiard ball or a small steel ball bearing (see Example 3.1 
below) or an imperceptible pinhead, since the fixed mass M > 0 can be arbitrarily 
small. Consequently, a natural question arises whether the exterior Schwarzschild 
solution (3.6) approximates reality well on large scales (see also [110]). 


Example 3.1 Setting M = 0.033 kg, R = 0.01 m, and Q = 5- 107° m, which is 
the radius of our Galaxy, we find from (3.10) that S = 5- 10-79 m and by (3.13) the 
difference 


V —V ®& 10000 km’. 


This is about 10!° times more than the volume of the ball itself. From this we see 
that the application of Einstein’s equations to galactic distances is questionable. 
Their application to cosmological distances may yield, by Theorem 3.3, surprising 
results, which may have nothing to do with reality (see also [198]). 


Remark 3.3 If M > 0 and Q > S are any fixed numbers such that (3.10) holds, 
then by (3.13) we get V < oo for R > S*,i.e., there is no singularity in volume 
near the ball with Schwarzschild radius. 


Remark 3.4 According to (3.6), the curved Schwarzschild space is in the radial 
direction described by the relation 
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By differentiation, we can check that 
/ dr = Vr 5) + Snr + Vr 3). 
r— 


Replacing the volume V by another expression V= $7((p(Q))> _ (p(R))>) in 
Theorem 3.3, we can similarly derive that V—-V => owas Q —> o, see [182]. 


3.3. Interior Schwarzschild Solution 


If o is the circumference of a nonrotating mass ball, then its apparent radius 


R= — 3,15 
a (3.15) 


is called the coordinate radius. 

In 1916, Karl Schwarzschild [322] found the first static nonvacuum solution of 
Einstein’s equations, cf. (3.16) below. He assumed that the ball with coordinate 
radius R > 0 is formed by an ideal incompressible nonrotating fluid with constant 
density to avoid a possible internal mechanical stress in the solid that may have a 
nonnegligible influence on the resulting gravitational field. Then by Ellis [98] (see 
also [105, p. 529], [332, p. 213], [390]) the corresponding metric tensor is given by 


. Fi I, Sr2 
Suv = aiag(—7 (3 x) <r ,r? sin 0), (3.16) 


where r € [0, R]. The tensor (3.16) is called the interior Schwarzschild solution, see 
Stephani [332, p. 213]. It is again a static solution, meaning that it is independent of 
time. To avoid division by zero in the component g1;, we require 


—Sr*>0  forallr € [0, R] 


which leads to the inequality 


R>S. 


We observe that for r = R the solution (3.16) for the interior of the ball continuously 
matches the Schwarzschild solution (3.6) defined outside the ball. We shall return 
to this important issue in Sect. 3.4. 
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From (3.16) we see that the corresponding time independent metric tensor 
(i.e. when dt = 0) satisfies 


R? 
di2 = naw dr? + r7d0? + r? sin? 6 dg” 
— Sr 
ar R3 as dr 
—~lao| | ® Sr? dé a7 
= 0 r- 0 ’ ( . ) 
do 0 0 r?sin?0) \dd 


where r € [0, R]. Now we will calculate the proper radius of the nonrotating ball 
defined by (3.22) below. For dé = 0 and d@ = 0 the equality (3.17) obviously 
reduces to 


pa (3.18) 
~ J —r?s?? 
where 
S 
si= rs (3.19) 
For r € [0, s~!) we can easily verify that 
1 
F(r) = — arcsin(rs) (3.20) 
S 
is a primitive function of 
1 
fv) = —_—.. (3.21) 


V1 —r2s2 


From the inequality R > S and (3.19) we see that 


/R -1 
R<Rj-—=s. 
S 


Hence, the primitive function F can be considered on the shorter interval [0, R] C 
[0, s—!). 

Using (3.18), (3.20), and (3.21), we now define the proper (relativistic) radius of 
the homogeneous nonrotating mass ball as 


wy [ dr 
R= ——_—.. 
0 1—r2s2 
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Fig. 3.3. Relativistic radius R 
is greater than the coordinate 
radius R = o/(27) 


From (3.20) we find that (see Fig. 3.3) 
* @... 
R= —arcsin(Rs) > R, (3.22) 
Ss 


since arcsinx > x for x € (0, 1]. 


Remark 3.5 The proper radius R represents a geodesic in a curved spacetime inside 
the nonrotating mass ball. It is the shortest path between any point on the surface of 
the ball and its center (0, 0, 0), cf. Fig. 3.3. As usual, the length of this geodesic is 
denoted by the same symbol R. For 0 <r, <1r2 < R we immediately get by (3.21) 
the inequality 


dr =1r2—T1. 


'2 1 rQ 
> | 
[ V1 — r252 rail 


From this we observe that the length element in the radial direction along Ris longer 
than the length element in the corresponding Euclidean space. 


Theorem 3.4 For a given fixed R > 0 we have 
R—>R_ fors > 0. 


Proof From (3.22) we find by Il’ Hospital’s rule [296, p. 388] that 


arcsin(Rs) i R 
im ————— = lim ——————. = R. 
s>0 Ss s>0 ./1 — (Rs)? 
The proof is completed. Oo 


From (3.10) and (3.19) we see that s — 0 if and only if M — 0. By Theorem 3.4 
the relativistic radius thus converges to the coordinate radius when M — 0. 
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The coordinate radius of a given fixed ball with circumference o is uniquely 
defined by (3.15), whereas its proper radius R is not uniquely determined. It depends 
on the mass M. The larger M is, the larger R is. 


Example 3.2 Consider a homogeneous ball with mass Mo = 2 - 10°° kg and 
coordinate radius 


Ro = 695,700 km (3.23) 


corresponding to the Sun. (Note that by the B3 resolution of the International 
Astronomical Union in 2015 (Hawaii) the radius of the Sun is given only to the 
above 4 significant digits.) Then by (3.22) we have Ro — Ro = 492 m. Hence, the 
associated Euclidean circumference is about 3 km shorter than 27 Ro; where Ro is 
the proper radius [163]. This relativistic effect is thus negligible when observing the 
Sun. 


Remark 3.6 In 1932, Lev D. Landau briefly mentions stars which could have the 
density of an atomic nucleus, see [212, p.228]. Two years later Walter Baade and 
Fritz Zwicky postulated the existence of neutron stars as remainders of supernova 
explosions and stated the hypothesis that this could be a source of cosmic rays 
(see [19, 20]). On August 6, 1967, Jocelyn Bell [127] observed periodically 
repeating signals at the Mullard Radio Astronomy Observatory in Cambridge (the 
radiotelescope was constructed by Antony Hewish). Then F. Pacini [267] and 
T. Gold [117] found that these signals are generated by rapidly rotating neutron 
stars. As of 2022 the fastest-spinning neutron star known is PSR J1748-2446ad, 
rotating at a rate of 716 times a second. A natural question arises: 


How far does its strong magnetic field act? 


Note that at the distance 70 km from the rotational axis the speed v of magnetic 
field lines in the equatorial plane would be 


v= 716s! - 22 -70km = 3.14- 10° km/s, 


which is more than the speed of light! Since magnetic field is a mass object, it may 
act only in very narrow relativistic jets along the rotational axis. Similar jets are 
observed also at black holes (see Fig. 12.4) and they could have the same origin. For 
the time being there is no generally accepted theory that explains the generation and 
collimation of these jets. However, it is not excluded that the kernel of any physical 
rotating black hole also produces strong magnetic fields, even though the theory 
admits only electric fields. 

Note that the current astrophysical hypothesis claims that the magnetic field is 
attributed not to the black hole itself. Rather, it is somehow generated from the 
plasma surrounding and falling into the black hole. 


Remark 3.7 Let us emphasize that one should carefully distinguish between an 
observed black hole and a theoretical black hole on a piece of paper. The interior of a 
physical black hole can contain a compact object with neutron star density (see [166] 
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and Fig. 12.2 below) instead of the proclaimed point singularity with infinite mass 
density. Infinity is only an abstract mathematical term. Moreover, it is important 
to realize that an arbitrarily large number is always in a close neighborhood of the 
origin on the real axis for a suitably chosen scale. 


At present, it is assumed that the core of a neutron star consists of cold quark- 
gluon plasma and that its mean density could be 2—3 times higher than the density 
of a neutron itself. 


Example 3.3 Consider an idealized homogeneous nonrotating neutron star with 
mass Mo and radius R = 10 km, which is in good agreement with [331]. Then 
by (3.10), (3.19), and (3.22) we obtain that R = 10.583 km (see Fig. 3.3). 


Theorem 3.5 If M > 0 is fixed, then the function R R— Ris decreasing and 
strictly convex on the interval (S, 00). 


The proof follows easily from (3.19) and (3.20) under the condition R > S$ 
(cf. Fig. 3.4). 


Theorem 3.6 The proper volume of a homogeneous mass ball with coordinate 
radius R > S > Ois 


2 
0 = S(R- RVI — Rs), 
AY 


where R is given by (3.22) and s by (3.19). 


Ee fer a a a ae | ae ee ET Lae | een 
= For an object with the Sun’s mass 4 
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Fig. 3.4 Dependence of the difference R — R on the coordinate radius R for the fixed mass M = 
Mo = 2- 10°° kg (see Theorem 3.5). Here Ro = 695700 km is the coordinate radius of the Sun. 
The radius Re = 6378 km corresponds approximately to a white dwarf and R, © 10 km toa 
hypothetical neutron star 
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Proof By (3.17) and (3.18) the volume element reads 


= dr 
dV = —_— . (rd) - (rsin@ dd). 
V1 —r2s2 ? 


By differentiation we can easily check that 


1 
H(r) = —<aresin(rs) — — V1 — 2s? 
253 Is2 


is a primitive function of 


r2 


V1 —r2s2 


on the interval [0, s~!), ef. [272, p. 298]. 
By Fubini’s Theorem [296, p.581] the proper (relativistic) volume of the 
homogeneous fixed mass ball can be expressed as follows 


7 R r? dr T 20 
em / sino ao [ dé = H(R)-2-2n 
0 0 0 


A(r) = 


Jl Pest 
= zl (ane Ri R?s?), (3.24) 
Ss 
Hence, by (3.22) we find that the theorem is valid. oO 


Theorem 3.7 Under the assumptions of Theorem 3.6 we have 
e A 
VoV:= Gas fors > 0. (3.25) 


Proof From Theorem 3.6, formula (3.22), and I’Hospital’s rule [296, p.388] we 
obtain 


R —(R— R?s”) + R38? 


arcsin(Rs) — RsV1 — R2s2 ; JT — R252 2a 
m = lim = —R?. 
s—>0 so s>0 352 3 
This together with (3.24) yields (3.25). oO 


Example 3.4 Consider a homogeneous ball with mass M = Mg = 5.97219 - 1074 
kg and coordinate radius R = Rg = 6378 km corresponding to the Earth. Then 
by (3.24) we get a surprisingly large relativistic effect, namely, 


Va — Ve = 457 km’, 
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where 
Ve ~ 10'? km?. (3.26) 


Theorem 3.8 If M > 0 is fixed, then the function R +» V/V is decreasing and 
strictly convex on the interval (S, 00). 


The proof directly follows from relations (3.19) and (3.24), and the inequality 
R>S. 


Remark 3.8 The convergence in (3.25) is monotonically decreasing. Hence, the 
proper volume V is larger than the coordinate volume V for M > 0, namely, 


V>V. 


The matter responds to gravitational compression in such a way that it increases its 
proper volume V when R is fixed and M increases. In other words, the larger M is, 
the larger V is for a fixed circumference and fixed V. Note that the mineral zeolite 
has a similar property due to its special crystal lattice. Whenever it is compressed, 
it increases its volume. Its atoms are arranged such that they have the maximum 
possible density. 


Example 3.5 For the Sun with coordinate radius Ro = 695700 km we obtain 
by (3.24) that (cf. (3.26)) 
Vo — Vo = 1.796-10'% km?, Vo — Vo = 1.197- 10! kin3, 


where Vo = 37 RB, and Vo — zm RB (cf. (3.26) and Example 3.2). For 
comparison, by integration of the vacuum solution we get by (3.13) that 


V—V =4.49-10!* km? 
for Q = Ro and R \, So, where 
So = 2.953 km (3.27) 


is the Schwarzschild radius of the Sun. 


Example 3.6 We will now show that there is a big difference between classical 
Euclidean geometry and the geometry of a curved spacetime inside high density 
objects. Consider an idealized neutron star described by a homogeneous ball with 
mass M = M, = 3Moe = 6- 10°9 kg and coordinate radius R = Ro = 10 
km (cf. Fig. 3.4). Then by (3.24) and (3.25) we get quite a large relativistic effect, 
namely, the ratio between the relativistic volume and Euclidean volume equals 


v= tk RJ1— S/R) = 1.666. (3.28) 
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Consequently, the relativistic volume V is 67% larger than the Euclidean volume V 
defined in (3.25). This means that the mean mass density 6 = M/V is smaller than 
p=M/V and (/p = 0.6. 


Since the density gradient of a physical neutron star is not known, formula (3.28) 
represents only an approximate lower estimate. Moreover, the relativistic volume 
extension could be sufficient to prevent density increase above the neutron or quark 
densities. The definition of mass density under strong gravitational field conditions 
is not an evident quantity, because the volume is influenced by the curved geometry 
of space under gravity. For derivations of density expressions under such conditions 
we refer to [104] (see also [100]). 


3.4 Composite Schwarzschild Metric Tensor 


For ro = R define the composite Schwarzschild metric tensor gyy by (3.6) for 
r > ro and by (3.16) forr ¢€ [0, ro]. It is easy to check that g,,, is continuous 
everywhere. However, we show that the first classical derivatives of the composite 
metric tensor have jumps. These derivatives appear in Einstein’s equations which 
are of the second order. 


Theorem 3.9 The composite Schwarzschild metric tensor gy, is not differentiable 
forr =ro. 


Proof We will show that the component gi; is not differentiable. From (3.6) 
and (3.16) we find that its first classical derivative does not exist at r = ro 
(see Fig.3.5). Namely, the component g11(7) of the interior solution (3.16) is an 
increasing function on [0, 79], whereas from (3.6) we see that the one-sided limit of 


81 


> 
ro r 
Fig. 3.5 The behavior of the non-differentiable component g1; = g11(r) of the composite metric 


tensor from (3.6) and (3.16). The first classical derivative (0g1;/dr)(ro) is not defined. The 
piecewise rational function g1; cannot be smoothed near ro, since then Einstein’s equations would 
not be valid in a close neighborhood of ro 
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the first derivative of the component g1;(7) = r/(r — S) of the exterior solution is 
negative 


3 
lim —(r) i 


+ Or 
r>1o 


The proof is completed. oO 


To show why the composite Schwarzschild metric tensor (3.6)+(3.16) is not a 
global solution of Einstein’s equation, we first present the following trivial example. 


Example 3.7 Consider the function u(x) = |x| on the interval (—1, 1). It solves the 
second order differential equation 


u(x) =0 


locally on the intervals (—1,0) and (0, 1). However, u = u(x) is not a global 
solution on the interval (—1, 1), since its first derivative u has a jump at 0. 


Thus from Theorem 3.9 we observe that the composite Schwarzschild metric 
tensor (3.6)+(3.16) is not a global solution of an idealized spherically symmetric star 
and its close neighborhood (see Fig. 3.2). All Riemannian spacetime manifolds have 
to be locally Euclidean which is not true in this case. Recall that an n-dimensional 
manifold withn € N = {1, 2,3, ...}1is aset of points for which there exists an open 
neighborhood that can be continuously mapped onto an open set in E” such that the 
inverse is continuous, too. An example of a manifold is the graph of a parabola, a 
hyperboloid of two sheets, the surface of a torus, and so on. On the other hand, the 
union of the hyperplane x; = 0 and axis x; in E” is not a manifold for n > 1. Also 
the set of rational numbers is not a manifold. 


Example 3.8 For comparison, note that the first order classical derivatives of the 
Newton potential uw of a similar problem are continuous. This problem is described 
by the Poisson equation 


Au = 41Gp, 


where p stands for the mass density. Let the right-hand side f = 42Gp be 
spherically symmetric and such that 


ifr é€ [0, 1], 
otherwise. 


1 
raf! 
The Laplace operator in spherical coordinates reads (see [296, p. 236]) 


Au 


02u 2 Ou ees 


au 1 du 
~ ar2. or dr or? ce ta ). 


062 00 sin? @ ag? 
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The term in parenthesis on the right-hand side is zero for the spherically symmetric 
case. By the well-known method of variations of constants, we find the following 
solution of the above Poisson equation 


ty i 
6 = 5 forr € [0, 1], 
u(r, A, op) = i 


—— otherwise. 
3r 


Hence, we see that both uw and du/dr are continuous at ro = 1. The second 
derivatives are square integrable in a neighborhood of rp = 1. 


3.5 Einstein’s Equations with General Right-Hand Side 


Einstein’s field equations (cf. [92]) without the cosmological constant are at present 
usually written in the following general form 


1 8G 
Ruy — 5 R8uv = —a Taw, p,v =0, 1, 2,3. (3.29) 
This is the system of 10 equations for 10 = 1+2-+3-+4 components of the unknown 
twice differentiable symmetric metric tensor g,,,. Now we describe the individual 
terms. The symbols R,,, stands for the symmetric Ricci tensor defined by (3.4), its 
contraction 


R= "Ruy (3.30) 


is called the Ricci scalar (i.e. the scalar curvature), and T,,) (= Ts &xp8av) 1s the 
symmetric tensor of density of energy and momentum. The left-hand side of (3.29) 


1 
Guy = Ruv = 7 R8uv 


is called the Einstein tensor. Contrary to the conventions in mathematical texts, the 
dependence of R,,, and R on g,,, is not indicated in the GTR. 


Remark 3.9 Let us emphasize that Einstein’s 10 equations (3.29) are not indepen- 
dent, since the covariant divergence of the right-hand side is supposed to be zero 
(see e.g. [249, p. 261)), ie., 


Thy ST +4 TY + TT! = 0, (3.31) 


where T4” = g*g*¥T,. and where all Christoffel symbols vanish when the 
spacetime is not curved. Note that the covariant divergence of the Ricci tensor R“” ; 
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is nonzero, in general. However, by the Bianchi identities (see e.g. [369, p. 147]) the 
covariant divergence of the whole left-hand side of (3.29) (i.e. the Einstein tensor) 
is zero automatically for g,,, smooth enough, e.g., if the third derivatives of g,,, are 
continuous (which is not the case in Fig. 3.5). Therefore, we have only 6 independent 
equations in (3.29). The number of independent components of the metric tensor is 
also 6, since one has to choose 4 coordinates. Relation (3.31) is called the law of 
local energy-momentum conservation, see [249, p. 386]. 

Recall (see [107, p.4]) that the principle of general covariance requires invari- 
ance of the form of physical laws under arbitrary diffeomorphic changes of 
coordinates. 


Finally note that the contravariant symmetric 4 x 4 metric tensor g“” in (3.29) 
which is inverse to the covariant metric tensor g,,, satisfies 


* 
Suv 


“te Oe ee ; 3.32 
det(gyv) et(guv) a ) 80v9 81 v1 8202 833 ( ) 


TES, 


ee 


where the entries eae form the 4 x 4 matrix of 3 x 3 algebraic adjoints of g,,y, S4 is 
the symmetric group of 24 permutations z of indices (vo, V1, v2, v3), sgn = 0 for 
an even permutation and sgnz = | for an odd permutation. 


Remark 3.10 Notice that the constant on the right-hand side of (3.29) is extremely 
small in the SI base units. Namely, 


8rG 8 - 6.674. 1071! 


re —43 -1,.-1.2 
aA x 37-1022 2-10 (kgm “s*), 


implying that the Ricci curvature tensor R,,, is also very small (if components of 
T,,v are not too large). 


Einstein in [91] presented the field equations (3.29) in an equivalent form that is 
presently written as follows: 


8G 1 
Ruv = a Taw = 5 8uv)s (3.33) 


where T := Tyyg’” = Te denotes the trace of T,,). To see that (3.33) is equivalent 
with (3.29), we multiply (3.29) by g“”. Then the traces of the corresponding tensors 
satisfy 


8G 
Cc 


Substituting this into (3.29), we get (3.33). 


60 3 Einstein’s Equations 


Now we present a quite surprising nonuniqueness property of Einstein’s equa- 
tions with nonzero right-hand side, in general (compare with sign conventions in 
[249, p.0]). 


Theorem 3.10 Jf g,,y is a solution of (3.29), then (—gyy) also solves (3.29). 


Proof From (3.2), we find that the Christoffel symbols of the second kind remain 
the same if we replace g,, by (—gyv), namely, 


1 
inten = 56 g)( Sva,u — Sauyv + Syv,r): 


Using (3.4), we find that the Ricci tensor R,» in (3.29) also does not change. 
Concerning the second term on the left-hand side of (3.29), we observe from (3.30) 
that ( - 5Reuv) also remains unchanged if we replace gj, by (—gyy). Oo 


Now we will derive the relation (3.4) from (3.29). 
Theorem 3.11 /f T,,,, = 0 then Einstein’s equations (3.29) imply that Ry, = 0. 


Proof Multiplying (3.29) by g”” and summing over all x and v, we obtain by (3.30) 
that 


1 


1 1 
0= g’Ryy — 5 Rat gu =R-— 5 RH, —R— 54k, 


where 54, is the Kronecker delta (i.e. 5% = 5! = 5%, = 5°, = 1 and 54, = 0 for 
je ~ v) and Oa =1+1+4+1+41=4. Thus, R = 0 and Einstein’s vacuum equations 
can be rewritten in the equivalent form 


Ru» =0, (3.34) 


where the unknown metric tensor g,,) is not indicated. Oo 


Concerning nonuniqueness, we observe from (3.2) and (3.4) that we can add any 
constant to any component gy, = gy, and Einstein’s equations Ry, = O will still 
be valid. 

In 1923, George David Birkhoff proved the following theorem (see [30]). Note 
that a similar statement was introduced by J@rg Tofte Jebsen two years earlier [142]. 


Theorem 3.12 (Birkhoff) Any spherically symmetric solution of (3.34) has to be 
static and asymptotically flat. 


This means that the exterior solution of vacuum Einstein’s equations (i.e. the 
spacetime outside of a spherically symmetric and nonrotating gravitating body) 
must be given by the exterior Schwarzschild solution (3.6). 


Remark 3.11 1n 1963, Roy P. Kerr found an axially symmetric solution of Einstein’s 
vacuum equations, see [144]. His solution (called the Kerr metric tensor) is a 
generalization of the Schwarzschild metric tensor when the central body has a 
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nonzero angular momentum. Kerr’s solution is described by two parameters: the 
mass M and the distance parameter a, where Mca is the angular momentum, 


A 2 
dr? = —(cdt — asin? dg)? + far? + p2d6? 
p 


sin? 6 


+ 
p2 


[(r? + a*)do — acdt}’, (3.35) 
where r, 6, @ are the standard spherical coordinates and 
A =r? —2MGr/c? +a’, p? =r? +a’ cos” 8. 


If a is positive, the metric tensor will be nondiagonal. For a = 0 and p > 0 we see 
that p = r and relation (3.35) reduces to 


Cdr? = (- ie oe ar i (1 = zi ) "dr? + r2d9? + r? sin? Odd. 
cr cr 

Using (3.10), we easily find that this leads to the Schwarzschild exterior solu- 

tion (3.6), i.e., if the rotation in the Kerr solution is put to zero, the Schwarzschild 

solution is retrieved. 

If 0 <a < S/2, then the radius U = (S + VS? — 4a*)/2 has a similar property 
as the Schwarzschild radius S given by (3.10). There exists a special surface defined 
by V = (S + VS? — 4a? cos? 6)/2, where 6 € [0, 7/2] is the angle from the 
rotational axis. The region between U and V is called an ergosphere. The behavior 
of geodesics lying in the equatorial plane is examined in [41]. 


From a mathematical point of view, it is confusing to call (3.35) a “metric”. The 
reason is that a function p : M x M > E! is called a metric on a manifold M if: 


1. p(A,B)>0 VA,BeM, 

2. p(A, B)=0 => A=B, 

3. p(A, B)= p(B, A) VA, Be M, 

4. p(A, B) < p(A,C)+ p(C, B) VA,B,C eM (triangle inequality). 


3.6 Black Holes 


A black hole is a region of spacetime where gravity is so strong that nothing (no 
particles or even electromagnetic radiation) can escape from it. A spherical surface 
with Schwarzschild radius (3.10) is called an event horizon. Its topology is S?. 
The curvature above the event horizon is usually described by the Schwarzschild 
metric tensor, or the Kerr metric tensor if the black hole is rotating, see (3.6) and 
Remark 3.11. 
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Stellar black holes are formed when very massive stars collapse at the end of their 
life cycle. For the origin of intermediate and supermassive black holes we refer to 
the paper [208] by Pavel Kroupa et at. Indirect evidence of their existence can be 
obtained from fast motions of orbiting stars, see e.g. [9, 319]. Anyway, for the time 
being no mechanism is known that some black holes grew to be a billion times the 
mass of the Sun just | Gyr after the Big Bang, even though the universe had much 
higher mean mass density than at present. These giant black holes could also be the 
remnants of the last Big Crunch. 


Remark 3.12 An English natural philosopher John Michell (1724-1793) was the 
first person who predicted the existence of black holes. He called them dark stars. 
In his 1784 paper [243], he writes: “..., if the semi-diameter of a spheere of the 
same density with the sun were to exceed that of the sun in the proportion of 500 
to I, a body falling from an infinite height towards it, would have acquired at its 
surface a greater velocity than that of the light, and consequently, supposing light 
to be attracted by the same force in proportion to its vis inertie, with other bodies, 
all light emitted from such a body would be made to return towards it, by its own 
proper gravity.” 


Example 3.9 Let us verify the above Michell’s original statement. The mass of a 
ball 500 times larger than the Sun with the same density is 500° Mo = 1.25-10°Mo. 
By (3.10) and (3.27) its Schwarzschild radius is § = 3.75 - 108 km. We see that this 
number is by (3.23) indeed comparable with radius of Michell’s ball r = 500 - 
695,700 km © 3.5 - 10° km. 


Remark 3.13 In 1799, Pierre Simon Laplace [215] came to a similar conclusion as 
Michell: “... the attractive force of a heavenly body could be so large, that light 
could not flow out of it.” In fact, Laplace also gave some details of his statement. In 
particular, from (3.10) we see that 


2MG 
C=. 


5 (3.36) 


This equality resembles the Newtonian relation for the escape velocity of an object 
with mass m from a ball with radius S and mass M, where 0 < m < M, see 
e.g. [184, p.52]. 


Remark 3.14 The origin of the term black hole is usually attributed to John 
Archibald Wheeler, see [375] which was published in 1968. However, already in 
1963 this term circulated during corridor conversations in the Texas Symposium 
on Relativistic Astrophysics in Dallas, see the historical survey [126]. Moreover, in 
1964 Ann Ewing published the paper [99]: “Black holes” in space. 


Finally, note that the entire universe should not be considered as a black hole 
as is sometimes claimed, since it has a different topology (S* or E* or H) than a 
three-dimensional ball in a curved spacetime (cf. Sect. 6.2). 
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3.7 Einstein’s Equations for Euclidean Space 


Throughout this section we shall assume that the spatial part of the 4 x 4 metric 
tensor is the following constant 3 x 3 diagonal tensor associated to the three- 
dimensional Euclidean space 


g°? = gun = dup andlet g® = g = g49 = gog =0 foralla, 6 = 1, 2,3. 
(3.37) 
That is, 
—g00 000 
0 100 
& = 6 101: (3.38) 
0 001 


However, goo = goo (x, x! x2, x3) may depend on all four coordinates x9, x1, x2, 


x°, in general, and for the determinant g of 8uv we have 
§ (= —800- (3.39) 


Note that the above notations of the tensor g,,, used in General Relativity differs 
from the standard notations in linear algebra, where g,,, denotes only one particular 
entry. The next theorem was inspired by the paper [190]. 


2 


Theorem 3.13 Assume that the metric tensor gyy = Zyy(x°, x!, x2, x9) satisfyin 
Su Su § 


Einstein’s equations (3.34) is of the form (3.38). Then 
_ 0 
800 = S00(x"), 


i.e., it is independent of x!, x7, x?. 


Proof First, we prove that the Christoffel symbols of the first kind will be zero for 
the case (3.37) if at most one index is zero. Since all nondiagonal entries of the 
metric tensor (3.38) are zeros and the time derivative of a constant is zero as well, 
we obtain that (see [173]): 


1. There are 6 = 1 + 2 + 3 independent components of Togg due to the 
symmetry (3.9) and we have 
1 
Toap = 7 (80a,p + g60,a — 86,0) =9 fora, B = 1,2, 3. 


2. There are 9 = 3 x 3 independent components of Fyog , 


1 
Teog = 7 (820.8 + 860,0 — 806.0) =9 fora, B = 1,2,3. (3.40) 
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3. There are 18 = 3 x 6 independent components of Fygy , 


1 
Vopy = 7 Sap.y + 8ya,p — &By,a) =9 fora, B,y = 1,2,3. (3.41) 
Second, we prove that the Christoffel symbols will be, in general, nonzero for 
the case (3.37) if at least two indices are zero. Hence, by the symmetry gy) = 
Sup, (3.8), (3.37), (3.38), (3.40), and (3.41) we have only the following nontrivial 


cases: 
4. By (3.39) there are 4 component such that 


1 1 1 
V0.0 = 7 (80x.0 + 800,x — 8x0,0) = 7800." = 5 8.x for x = 0, 1, 2, 3. 
(3.42) 
5. Finally, for the remaining 3 components we get 
1 1 1 
Teoo = 7 (8a0,0 + 00,0 — 00,0) = 7 800.0) = 78.0 fora = 1, 2,3. 
(3.43) 
So altogether we have evaluated 


6+9+18+4+3=40 


symbols I’,,,,,, but only 7 of them are nonzero, in general. 


The Christoffel symbols of the second kind are defined by (3.2). However, since 
in our case the metric tensor is diagonal (3.38), we find that 


MM =38"'Tiu, %, ft, ¥=0,1,2,3. (3.44) 


Here we do not use Einstein’s summation convention. Thus, by (3.2), (3.37), (3.39), 
(3.42), and (3.43) we get 


Pp =—Se foru=0,1,2,3, (3.45) 
& 


where g does not vanish, and 
a 1 
Moo = — 58.0 fora = 1, 2,3. (3.46) 


By (3.9) and (3.2) we obtain the symmetry in the last two indices of the 
Christoffel symbols of the second kind, 


My ee (3.47) 
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Further, we shall calculate the Ricci tensor. From (3.4) for the entry Roo we get 
Roo = V0 — F’ox,0 + Mole — Mola: (3.48) 


Below we shall not use Einstein’s summation convention for the index a = 1, 2, 3. 
For the first part of (3.48) with derivatives we obtain by (3.46) that 


3 3 
1 
0 0 
T00,% — T’ox,0 = (T'00,0 — F00,0) + » P00 = = S 8,00 » (3.49) 
a=1 


a=1 
since by (3.40), (3.41), and (3.44) we have 
ri, =0 torwa1,2.3,. A= 0,1,2,3. 


From this for the second part of (3.4) with nonlinearities we have by (3.39) 
and (3.45)-(3.47) that 


3 3 
ly 8 
0 0 0 , 
Mol. x — Tol On = Tool 20 Tool" 0A pa r da! “oo _ 4 = gy. 
a=1 a=1 
Substituting this and (3.49) into (3.48), we obtain by (3.34) the following second 
order partial differential equation for the unknown function, see (3.39), 


Pa ee 
namely, 
3 


Roo = > (8.00 - ear) =0 (3.50) 


a=1 


with the standard Laplace operator and the squared gradient. 

Now, we show that for the metric tensor (3.38) the entries Rog of the correspond- 
ing Ricci tensor are identically zeros without using (3.34). This means that we do 
not get any further condition on g. Using (3.4), (3.47), and (3.45), and the equality 
ee = 0 if at most one index is zero, we obtain for 6 = 1, 2, 3 that 


Xx x 
Rop = Pog. — Pog +P opP ax — Pol bn 
a0 0 0 10 i 0 
= Pop.0- TV 00,6 +P ogl 00 — Fool po = V 0g,0 — F 00,8 


0. 


(2) ~ (£2) _ _ 18,608 ~ 8.68.0 8.08 8 + 8.08.8 _ 
2\ ¢/0 2\ g/.8 2 g 
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Further, we find that 
Xr Xr 
Rap = Bx ~ Doce oe ab! ax =f oe we 


0 0 0 10 0 pO 0 0 pO 
= VP og.0— Pao, tPF apl 00 — Pao!” go = —V a0, — Pool go 


a 


= (2) 18086 _ 8ab8~8a8p  8a8,p 
2g/p 428 8 2g? 4g? 
3 
ee, apato8. 
2g 4g? 


From this and (3.34) we obtain 6 partial differential equations different from those 


in (3.50), namely, 


38.0 8p Z 


0, a, fp =1,2,3. (3.51) 
2g 


8,aB — 


Finally, we show that a general solution of the system (3.50) and (3.51) is 
independent of the space variables x!, x7, x>. Setting 6 := a and summing (3.51) 
over all aw, we obtain 


Ges = 


a=1 28 


Subtracting this from (3.50) and multiplying by g gives 


which yields 


Hence, a general solution is of the form (see (3.39)) 
g = B(x"), i.e. 800 = goo(x°). 


The proof is completed. Oo 


The class of all solutions of (3.50) and (3.51) is therefore not too rich when com- 
pared with Sects. 3.2 and 3.3. Theorem 3.13 may have applications in cosmology 
for a flat space with zero cosmological constant. 
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3.8 Cosmological Constant 


In 1917, Albert Einstein introduced a new form of his equations (see [94]) which 
are at present usually written as follows 


1 80 
Ruv = 5 R8uv + Any = ew (3.52) 


with non-zero cosmological constant A € E!. A positive value of this constant 
produces a repulsive force which does not allow a gravitational collapse of the entire 
universe which follows from (3.29). Einstein wanted to keep his original idea of 
a Static universe. We will derive a formula for the total mass of Einstein’s static 
universe in Remark 7.22. 

The left-hand side of (3.52) is divergence-free, since the Einstein tensor is 
divergence free G" i = 0 and the metric tensor is also divergence-free a = 0, 
see e.g. [107, p. 80] for the proof. Hence, the law of energy-momentum conservation 
remains upheld. 


Remark 3.15 It is said that the gravitational constant G is the worst established 
constant of all fundamental physical constants, since its value is known only to 
3 (or 4) significant digits. However, we do not know any significant digit of the 
cosmological constant A, yet. We even do not know whether it is positive or 
negative, if it exists, although there are hundreds of papers about this idea of 
Einstein. Some of these papers claim that the GTR is valid with precision 99 %. 
Nevertheless, the solution of Einstein’s equation essentially depends on A, see 
e.g. Fig. 6.7 below. The current standard cosmological model only assumes that 
A & 107°? m7, ie., an extremely small number in the SI base units that can 
play an essential role only on cosmological scales. This value can be obtained 
from (6.31), (6.29), and Remark 6.14. 


Theorem 3.14 Let gy, be a solution of (3.52). Then (—gyv) solves (3.52) if A is 
replaced by (—A). 


Proof We will proceed similarly as in the proof of Theorem 3.10. Using (3.2), we 
observe that the Christoffel’s symbols of the second kind remain the same if g,,y is 
replaced by (—gyv), 


1 
De = 56 ge \( 8vd,u Sdu,v + Syv,r)- 


By (3.4) we find that the Ricci tensor R,,) in (3.52) does not change as well. 
Concerning the second term on the left-hand side of (3.52), we see from (3.30) 
that ( - 5Repv) remains also unchanged if g,,, is replaced by (—g,). Finally, for 
the third term we get Ag, = —A(—gyv). oO 


Remark 3.16 According to the current standard cosmological model, dark energy 
is some mysterious substance which is responsible for the accelerated expansion 
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of the universe and this is attributed to the cosmological constant A. Its physical 
dimension is m~, since the left-hand side of (3.52) has dimension s~?. In spite of 
that, cosmologists describe it as the density of energy which has another physical 
dimension in the SI base units (International System of Units), namely kg m7!s~?. 
So can we talk about density of energy? 

In the system c = 1, meters and seconds can be arbitrarily exchanged using a 
suitable multiplicative constant. However, the physical dimension of A again does 
not correspond to the density of energy, since kilograms also do not appear there. 


Remark 3.17 Let us consider a special case of (3.52) when 
Tuy = 0. 
Similarly as in proof of Theorem 3.11 we find by (3.30) that 
O= gM” Ruy + (A — 5R)gh’ guy = R+A4(A— 5R) =4A-R, 

i.e., in this case Einstein’s equations (3.52) can be rewritten as 

Ruv = Agyy- (3.53) 
Remark 3.18 When Einstein’s paper [94] appeared, then shortly afterwards several 
articles (cf. [153, p.438], [374]) were published that looked for an exterior 


Schwarzschild-like solution of (3.53) using the standard spherical coordinates, see 
[167, Appendix C] for a detailed derivation, 


=—S§ Ar —S  Ar?,-1 
Sun = diag( (- - - ).C - ; ) 7? sin? 6, 7°), (3.54) 


where S is given by (3.10), r > S, and 6 € [0, 27]. We observe that (3.54) reduces 
to the exterior Schwarzschild solution (3.6) for A = 0. If A > O and S = 0 (e., 
M = 0 by (3.10)), we get the de Sitter metric tensor [80] (or the anti-de Sitter 
metric tensor for A < 0). Moreover, if A = 0 and S = 0 then (3.54) changes into 
the Minkowski metric tensor (3.5). 


Remark 3.19 Let A > 0 and let us define 
L:= AT, (3.55) 


Assume that S > 0 is such that S§ < L/2. Then 


S  4AL? 1. S 


2L)=-1 = 
g00(2L) cae : aT: 


> 0. (3.56) 


3.8 Cosmological Constant 69 


Fig. 3.6 Schematic uN 
illustration of the behavior of &, 00 
the metric coefficient 

200 = o0(r) from (3.54) that 
changes its sign when A > 0 


Hence, the continuous component go99 = goo(r) changes its sign on the interval 
(L,2L), i.e., there exists r € (L,2L) such that goo(r7) = O, see Fig. 3.6. 
Consequently, in the evaluation of the next component 


r—S — 
3 


gir) = ( 


one divides by zero for r = 7 and the metric tensor breaks down. The tensor (3.54) 
is thus not well defined at cosmological distances even though the cosmological 
constant was invented just to treat extremely large distances in the universe. The 
component g11(r) attains arbitrarily large values when r — 7” even though there 
is no physical barrier. Therefore, the quadratic form gy (r)v“v" is also not well 
defined at r = 7 for any contravariant vector v“ such that v! # 0. A different 
choice of coordinates does not help to avoid this singularity, since the form does not 
change its value for a given r. 


Chapter 4 ®) 
Numerical Analysis of Mercury’s cre 
Perihelion Shift 


4.1 A Brief Historical Overview 


The perihelion shift of Mercury’s orbit is thought to be one of the fundamental 
tests of the validity of the General Theory of Relativity [107, 249, 303, 312]. In 
the current astrophysical community, it is generally accepted that the additional 
relativistic perihelion shift (advance) of Mercury is the difference between its 
observed perihelion shift and the one predicted by the N-body problem, and that this 
difference equals 43” per century. However, as it results from the subtraction of two 
quite large and inexact numbers of almost equal magnitude which are not uniquely 
defined, this difference can be quite inexact. As such, the above accepted value 
43" cy! (are seconds per century) is highly uncertain and may not correspond to 
reality. In this chapter, we present a thorough mathematical and numerical analysis 
of diverse misconceptions about Mercury’s perihelion problem (mostly taken from 
[90, 164, 172, 174, 356). 

The French astronomer Urbain Le Verrier (1811-1877) is best known for 
his prediction of the position of Neptune (at that time an unknown planet). He 
performed his calculations to explain certain irregularities in the orbit of Uranus. In 
1859, Le Verrier also noticed some anomalies in the observed position of Mercury’s 
perihelion compared to Newton’s theory [220]. Again he started to look for another 
(still unknown) planet named Vulcan which should orbit the Sun more closely than 
Mercury. The existence of such a hypothetical planet deemed to be impossible at 
present, because we would have sometimes seen it when crossing the solar disk or 
during total solar eclipses. 


Remark 4.1 According to [346, p.36], Le Verrier needed to explain a shift of 
38” per century in order to make the predicted position of Mercury’s perihelion 
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using Newtonian mechanics to be in agreement with observations. In 1895, Simon 
Newcomb [260, Chapt. IX, p. 184] arrived at the comparably very small value of 


E = 43.37" per century. (4.1) 


In 1915, Albert Einstein published in [90, p.839] a formula for the relativistic 
perihelion shift, for one period, of 


e = 24n3 = 5.012- 1077 rad ¥ 0.1", (4.2) 


a 
T2c2(1 — e”) 
where according to contemporary data 


T = 7.6005 - 10° s is the orbital period of Mercury, 

e = 0.2056 is the (numerical) eccentricity of its elliptical orbit, 

a = 57.909 - 10° m is the length of its corresponding semimajor axis, 
c = 229,792,458 m/s is the speed of light in a vacuum. 


Substituting these values into (4.2), we obtain a value of 


180 
E= ex — 3600" = 43” per century, (4.3) 
ia 


that is in excellent agreement with the calculations of both Le Verrier and New- 
comb (4.1). Here t = 3,155,814,954s is the number of seconds in one century. In 
astrophysical community, the simple formula (4.2) is applied to test the validity of 
the General Theory of Relativity without any doubts how it was derived. Therefore, 
now we will point out several shortcomings which should be taken into account. 


Remark 4.2 In 1898, Paul Gerber derived the following formula for the speed of 
light by means of retarded potentials (see [113]) 


a2 


2 3 
a 
” PI e)@ 


’ 


where @ is the perihelion shift of Mercury during one orbital period. We see that 
this formula is the same as (4.2) taking 


eE=@. 


Therefore, the corresponding tests of General Theory of Relativity based on (3.4) or 
Einstein’s system (3.1) + (3.3) yield after many approximations the same value (4.3) 
as tests of Gerber’s theory of retarded potentials. So which theory do we test? 


We should stress here that over the past 100 years Mercury’s perihelion was 
shifted by a total amount of about 5600” (see e.g. [66, p. 363], [231, p.68]), of 
which approximately 5027” is due to the precession of the Earth’s axis. The vernal 
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ascending node 


vernal equinox 


trajectory 


Fig. 4.1 Keplerian parameters of the elliptical orbit of a planet that determine its orientation in 
space are: the inclination 7, the longitude of the ascending node 2, and the argument of perihelion 
a. They vary slightly in time due to the presence of other planets, precession and nutation of the 
Earth’s axis. The ecliptic is the plane containing the Earth’s orbit around the Sun 


equinox (one of the two intersections of the ecliptic and the celestial equator, see 
Fig. 4.1), from which the equatorial right ascension is measured, moves along the 
ecliptic by, on average, 50.27” per year. This value is also continually distorted by 
the nutation of the Earth’s axis of +£9.21” with a period of 18.6 years which does not 
divide 100 years. The accurate determination of the position of the vernal equinox 
is therefore a very difficult task. Moreover, due to non-zero inclinations of planets 
(Jupiter 1.3°, Saturn 2.5°, etc.), the position of the ecliptic in space changes with 
time. Thus, the use of a time-dependent equatorial coordinate system with right 
ascension and declination is not appropriate. 

Therefore, we will next consider a rectangular heliocentric system whose 
position is unchanged with respect to the fixed distant stars. In this system, the 
currently observed perihelion shift of Mercury due to the gravitational pull of the 
other planets is about 575” per century which is a more than ten times larger value 
than that in (4.3). On the other hand, according to the calculations of Le Verrier [220, 
p. 99], Mercury’s perihelion shift is influenced by the other planets as presented in 
Table 4.1. 


Remark 4.3 The total sum in the last line of Table 4.1 is to be understood only 
approximately, because the problem in question is nonlinear and such a sum is thus 
not justified. Additionally, although the other values in Table 4.1 are given in up to 
four significant digits, not all of them are accurate. For instance, at the time of Le 
Verrier, the masses of the planets were not precisely known: Mercury was believed 
to have double the mass assigned to it nowadays (using Kepler’s third law and the 
accurately measured orbital period of the Messenger satellite which orbited Mercury 
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oe ae Thee cla Planet | Its impact on Mercury’s perihelion advance 
particular planets on a —__— : booms 
Mercury’s perihelion shift per Venus | 280.6 
century by Le Verrier [220] Earth | 83.6” 

Mars 2.6" 

Jupiter | 152.6” 

Saturn 7.2" 

Uranus | 0.1” 

Total | 526.7” 


Fig. 4.2 Left: the idealized uniform perihelion shift of Mercury in the direction of circulation 
is shown. For clarity, a very high artificial eccentricity e = 0.8 is chosen. Right: an irregular 
perihelion shift caused by the gravitational tug of other planets is schematically depicted 


between 2011 and 2015), see [328]. On the other hand, the mass of the Earth was 
underestimated as 0.937 of its currently accepted value [220, p. 19]. Therefore, Le 
Verrier was not able to derive reliable values for the gravitational forces between the 
planets. Neither did he solve the system of ordinary differential equations describing 
the N-body problem (see (1.14)), but made approximations using certain sums of 
finite series instead by means of his perturbation theory. Finally, he only had relevant 
data concerning the positions of the planets over the preceding 50 years. 


Remark 4.4 The values in Table 4.1 are also affected by other sources of errors. For 
instance, the relative position of Jupiter in its orbit at the beginning of each century 
varies due to the simple fact that its orbital period of 11.861 years does not divide 
100 years. As the same holds true for the other planets, they all influence Mercury 
in an irregular manner from varying positions (cf. the right part of Fig. 4.2). Thus, 
the observed perihelion shift can essentially differ century by century. 


Our further aim will be to provide a more critical insight into the problem of 
Mercury’s perihelion shift. In Sect.4.2 we recall that the difference between two 
almost equally large numbers can be quite inaccurate. In Sects.4.3 and 4.4 we 
illustrate why the observed and calculated values of Mercury’s perihelion shift are 
highly imprecise. In Sect. 4.5 we show that the simple formula (4.2) due to Einstein 
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was derived from the 10 nonlinear Einstein equations (3.1) by many approximations 
without any guaranteed error estimates. In Sect.4.6, formula (4.2) is applied to 
other systems than the Sun-Mercury system. Finally, in Sect. 4.7 we repeat the main 
arguments concerning why the value (4.3) is questionable. 


4.2 The Inaccuracy of the Difference Between Two Almost 
Equally Large Numbers 


It is true that the manual calculations of Le Verrier and Newcomb (cf. (4.1)) led to 
values close to (4.3). However, they did not reach the accuracy of simple computer 
arithmetic in which each number is stored in six bytes, they originated from inexact 
data, and were obviously subject to various errors (see e.g. [137]). Actually, their 
modeling and discretization errors (see Fig. 1.9) were larger than the relativistic 
effect itself. Therefore, we restrict ourselves to the current status quo. 

Denote by the letter O the observed value of Mercury’s perihelion shift per 
century with respect to the Sun, and by C the calculated value per century in the 
heliocentric coordinates using Newtonian mechanics. At present, it is generally 
accepted by the current astrophysical community that the following equality is true 


O-C=E, (4.4) 


where E is the value (4.3) predicted by Einstein. For instance, Narlikar and Rana 
(257, p. 657] claim that 


O = 575" per century and C = 532” per century (4.5) 


which yields the value (4.3) by (4.4). However, they do not present any reference 
for the value O. Note that O cannot be by Remark 4.4 uniquely defined. The same 
is true for the calculated value C. 

For simplicity, most of the perihelion shifts will be from now on rounded to 
integers given in arc seconds. We shall not present the corresponding error bars, 
since they are always less than 1” in recent literature. It is clear that if at least one 
term in relation (4.4) is not correctly established, then the proclaimed equality (4.4) 
cannot be properly used and the value E may differ from reality. 

Numerical analysts know very well (see [45, 118]) that subtracting two almost 
equally large numbers (see (4.4)) in floating point computer arithmetic is burdened 
by a large resulting relative error. Although generally no rounding appears when 
subtracting such numbers, the mantissa of the difference contains only a few 
significant digits which inevitably leads to loss of precision. Let us present an 
illustrative example. 


Example 4.1 Assume for simplicity that the mantissa has only 5 digits. Let us 
evaluate the difference 3.1416 - 10° — 3.1415 - 10°. Then the computer saves the 
number 1.0000 - 10-4, where four zeros in the mantissa are not significant digits. 
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This means that the error of the difference moves from the originally last digit of 
the mantissa to the second digit. 


The following three sections show that all three values in (4.4) are loaded with 
plenty of different errors, and thus the proposed equality (4.4) is not very likely 
to be numerically verifiable. From Remark 4.4 we know that the observed (and 
also calculated) perihelion shift must differ century by century, cf. the right part of 
Fig. 4.2 and Figs. 4.4 and 4.5 below. Furthermore, we see that the full angle has over 
a million arc seconds, namely, 


u = 360 - 3600” = 1,296,000", 


while Eq. (4.2) gives less than one arc second per year. Mercury’s perihelion 
separation from the Sun is r} = a — ae = 46- 10°km. According to (4.3), the 
additional perihelion shift is 


2mr} ” 
—— - 0.43" = 96km per year, (4.6) 
u 


while in [249, p. 1048] the additional perihelion shift (4.3) is presented quite 
inexactly as 120 km per year. For comparison, note that the orbital speed of Mercury 
is about 50km/s. Hence, the time instant of the passage of Mercury through the 
perihelion should be established extremely precisely. 


4.3. The Observed Perihelion Shift of Mercury 


To determine the exact position of Mercury’s actual perihelion is one of the most 
difficult tasks in contemporary positional astronomy and computational mathemat- 
ics due to a multitude of reasons. In the case of a small eccentricity 0 < e < 1, we 
get an ill-conditioned problem, since for a circular path the perihelion occurs at each 
point. The eccentricity of Mercury’s orbit is a relatively large number e = 0.2056 
(see Table 1.1), but since the semiminor axis of Mercury has by (1.2) the length 


b=avV1—e- =0.98a, 


the orbit is almost circular with the Sun at one of the foci. 

Mercury can be seen only a few days per year in the projection on the shining 
celestial sphere. Hence, a 100 years ago, it was difficult to reliably determine from 
Earth its angular distances from neighboring stars on the celestial sphere. Since 
Mercury is mostly close to the horizon at sunrise or sunset, serious difficulties were 
also caused by the astronomical refraction of the atmosphere, see Table 4.2. 

Moreover, the refraction depends on the air temperature, pressure, humidity, etc. 
Notice that the value in the last row of Table 4.2 is given in arc minutes, and not 
in arc seconds (cf. (4.3)). It is even larger than the angular diameter 30’ of the 
Sun. Consequently, when the Sun touches the horizon at sunset and we would 
immediately remove the Earth’s atmosphere, the Sun would already be below the 
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Table 42 Influence of the Zenith distance | Mean refraction 
zenith distance on the mean a er 


re) / 
atmospheric refraction 0 (9 
20° 0/22” 
40° | 0/50” 
60° 1/43” 
70° 2/43" 
80° 5/30” 
90° 35’ 
P,P, 
Mercury 
Earth 


Fig. 4.3. Since the length of semiminor axis is 98 % of the semimajor axis, the orbit of Mercury is 
almost circular. The angle O corresponds to the observed Mercury’s perihelion shift per century. 
For a better visualization, the angular distance O between two perihelia P; and P in the figure is 
magnified 200 times. The observational data from the Earth must be recalculated to find the value 
of the angle O in the heliocentric coordinates 


horizon and darkness would occur. Let us note that the refraction of the solar 
atmosphere is ignored in another test of GTR—called bending of light near the 
solar disk which is more than hundred times larger than the Earth. 


Remark 4.5 To determine the exact present position of Mercury in heliocentric 
coordinates, we need to know precisely both the time-dependent right ascension and 
declination, but also the distance from the Earth, which currently can be obtained 
by using radar reflections [16, 143, 330]. However, these measurements were done 
in a time interval which is much shorter than one century, i.e., we do not have 
reliable data for one century. Moreover, we have to take into account that even the 
Earth moves during the measurements along a complicated nonelliptic orbit. The 
precise position of Mercury on the celestial sphere were also determined by satellite 
measurements (see Remark 4.3). 


Since the vertex of O lies in the Sun, this angle seems always to be much 
smaller when it is observed from the Earth (see Fig.4.3). In fact, the observed 
positions of Mercury in finitely many points have to be interpolated in a rather 
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Fig. 4.4 Projection of the 
trajectory of the Newtonian 
barycenter of the Solar 
system onto the ecliptic for 
the period 2000-2050. The 
center of the Sun (whose 
diameter is almost 1.4 million 
km) is at the origin of the 
heliocentric system. The 
barycenter shifts each day by 
about 1000 km, while the 
additional relativistic 
perihelion shift of Mercury is 
on average only 96 km ina 
year due to (4.6) 


2020 


complicated way and then recalculated to the heliocentric coordinate system which 
is fixed against distant quasars. Estimation of the acceleration of the Solar-system 
barycenter relative to a system of reference quasars is established in [347]. 

Nevertheless, here arises another serious problem, because the true orbit of 
Mercury is not elliptical. For illustration, consider first a simple two-body Sun- 
Jupiter system, where masses of the Sun and Jupiter are 


M = 1.989-10°°kg, m = 1.899. 107’kg, 


respectively. Assume that their trajectories are circular and that the barycenter of 
this system lies in the origin of Cartesian coordinates. Then the distances R = 
743, 000 km of the Sun to the origin and of Jupiter r = 778 - 10° km to the origin 
satisfy the relation MR = mr while the radius of the Sun is Ro = 695,700 km, 
see (3.23). Thus R = mr/M > Ro and we observe that the barycenter of the 
Sun-Jupiter system lies outside the Sun, and the Sun and Jupiter orbit about it. 

However, the Sun’s trajectory is also deflected by other planets (especially those 
very distant and massive), i.e., the heliocentric system is not inertial. Figure 4.4 
shows a two-dimensional projection of the trajectory of the Newtonian barycenter 
of the Solar system in the heliocentric system over the duration of half a century, 
starting from 2000. Thus, the actual trajectory of Mercury is not an ellipse, one of 
whose foci is placed at the Sun, but it is a very complicated nonplanar trajectory, 
which is influenced by the other planets and thus also by the barycenter of the Solar 
system. The barycenter shifts each day by about 1000 km which is a much larger 
value than the current shifts of the perihelion of Mercury’s orbit given in (4.6). 

Note that the largest elongations in Fig. 4.4 appear when Jupiter and Saturn are in 
conjunction with respect to the Sun every P = 19.856 years. Here P = 1/ Ge = 
Te), and Ts and 76 are the orbital periods of Jupiter and Saturn, respectively, see 
Table 1.1. 

It is therefore unclear how to define precisely e.g. the direction of the semimajor 
axis of Mercury (cf. Remark 4.17), which according to (4.2) should be shifted. It is 
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also unclear how to determine an analogy of the Newtonian barycenter of the Solar 
system for a finite speed of propagation of gravity. In this case, the problem becomes 
non-potential, which is a little known fact. 

Let us present other facts that need to be taken into consideration for determining 
the actual position of Mercury. 


Remark 4.6 Recall that the average speed of Mercury is about 50 km/s. Mercury’s 
diameter is nearly 5000 km and its angular diameter when crossing the solar disk 
is 12”. Before information about its position reaches our Earth at the speed of 
light, Mercury will move more than three of its diameters, ie. approx 40” and 
17,000 km (cf. (4.3) and (4.6)). But astronomical tables indicate planets where they 
are observed and not where they really are. 


To determine the actual position of Mercury it is necessary to take into account 
also the aberration of light, which for the Earth’s orbit is ~ 20”. Terrestrial 
measurements are also slightly affected by the atmospheric dispersion of light. 
Furthermore, from precise observations we know that the measured secular accel- 
eration of Mercury is proportional to its mean motion. Secular long-term deviations 
in the mean longitudes of this planet have reached 5” during the last 250 years [150, 
p. 884], i.e., after approximately 1000 periods. From this Igor N. Taganov [338, 
p. 72] derived that Mercury’s orbit slightly expands at the rate comparable to the 
Hubble constant. 

We should also take into consideration the calibration of instruments, which is 
essential when determining equatorial coordinates of Mercury and its distance from 
Earth. Determining the actual orbit of Mercury from astronomical observations is 
therefore an extremely difficult task which has the potential for different errors that 
have a significant impact on the value O from (4.4). 


Remark 4.7 Interpolation of measured position data represents another important 
source of errors. The inclination of Mercury’s trajectory with respect to the ecliptic 
is i = 7°, and therefore, we do not deal with a planar problem (see Fig. 4.1). 
From Sect.4.1, we know that positions of planets are considered in rectangular 
heliocentric coordinates (X, Y, Z). They are interpolated by a finite series. For 
illustration we show how to express, for instance, the X-coordinate of the Earth’s 
center in time f: 


X =0.0056114 + 0.001234 t + 0.9998293 cos(1.7534857 + 6283.075850 r) 
+ 0.000011 t cos(2.02 + 6283.1 £) 


38 40 
us >; A; cos(B; + C;t) + sm Ajt cos(B; + C;t), 
i=l i=39 


where the 120 constants A;, B;, C; are given by a table in [47, p.313] and ¢ is 
measured in thousands of years from the Julian date J2000.0. This is therefore 
an approximate analytical expression of X in the form of the sum of polynomials 
with trigonometric polynomials, which contributes to the total interpolation error 


80 4 Numerical Analysis of Mercury’s Perihelion Shift 


|X| — oo for t > oo. We see that the remaining components Y and Z, and also the 
heliocentric coordinates of other planets are parametrized similarly. The constants 
A;, B;, C; are corrected every few years so that they are consistent with the observed 
positions of the planets, cf. [46, 47]. However, in this way the true perihelion shift 
of Mercury is perturbed by interpolations. 


4.4 Computed Perihelion Shift of Mercury 


Figure 1.9 shows the general computational scheme for solving real-life (non- 
academic) problems of mathematical physics. We always produce three types of 
error: the modeling error eo, the discretization error e;, and computational error e2. 
In our case, the physical reality is the Solar system. Its evolution is usually modeled 
by the problem of N bodies that interact with each other gravitationally. Bodies are 
replaced only by idealized mass points mj > 0,1 = 1,..., N, whose positions r; 
satisfy the well-known system of second order differential equations (see (1.14)) 


Uli ri) 
pao mad areca (4.7) 
J#i 
fori = 1,..., N with given initial conditions on the positions 7; (0) and velocities 
r;(0) of all N bodies. Here | - | stands for the length of a vector and G is the 


gravitational constant. 


Remark 4.8 Henri Poincaré proved [292] that for N > 2 the N-body problem 
generally does not have an analytic solution. Therefore, the solution of (4.7) is 
usually approximated by numerical methods. This leads to the so-called discrete 
finite dimensional model, which produces a nonzero error e;. The N-body problem 
is not stable with respect to changes in initial conditions and continually acting 
perturbations. Thus each numerical integration method gives a considerable dis- 
cretization error over long intervals, even if we use double (8 bytes) or extended (10 
bytes) computer arithmetic. 


Further, we note that the error ez depends among other things on the computer 
used, programming language, and also on the method of programming. For a catas- 
trophic behavior of rounding errors in ill-conditioned problems, see e.g. [45, 162]. 
Numerical errors usually do not average out, but accumulate and grow exponentially 
in the course of calculation. Thus, each programmer (and also each observer) can get 
different results. For instance, in Russia completely different programs are used to 
calculate the ephemerides of planets [287, 288] than in France [47] or in the NASA 
Jet Propulsion Laboratory, Pasadena, USA, see e.g. [106, 329, 330]. 

The difference between the exact and approximate solution of a mathematical 
model is estimated in numerical mathematics by a priori or a posteriori error 
estimates. This step is very important. If we do not perform a reliable analysis of 
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Fig. 4.5 The shift of the line 20 
of apsides of Mercury’s orbit 
during 5 years only, 18 
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errors, we do not know what we have actually calculated, and how far the numerical 
solution is from the exact solution. 

For elliptic orbits the line segment connecting the perihelion and aphelion is 
called the line of apsides. However, if the perihelion shifts, then we have two 
different lines of apsides, the first connects the perihelion with previous aphelion 
and the second connects the perihelion with next aphelion. As already mentioned, 
the orbital periods of the planets are not in a ratio of small integers, and therefore the 
“line of apsides” rotates quite irregularly. This follows from numerical simulations 
of the N-body problem. For instance, by Rana [295], the perihelion shift may 
increase about 24” or decrease about 11” within less than 1 year, which has a 
nonnegligible influence on the average shift per century. From the highly fluctuating 
and chaotic behavior of the shift of the line of apsides from Fig. 4.5 it is obvious that 
the value C from (4.4) cannot be determined with an accuracy better than several 
arc seconds per century. Therefore, in the literature several different values of C can 
be found, for instance, 


C = 532” per century (see [249, p. 1113]), 
C = 531” per century (see [312, p. 1]), 
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C = 530” per century (see [356, p. 6]), 
C = 529” per century (see [295, p. 197]), 
C = 527" per century (see the last line of Table 4.1 taken from [220]). 


So which value is correct? In spite of this the difference E = O — C given by (4.5) 
has only one representation E = 43” per century in the currently available literature 
(see e.g. Table in [286, p. 1175]) and the Einstein value (4.3) is presented as accurate. 

Moreover, it is not clear what is the definition of the perihelion shift per century. 
If it refers to some kind of average value, then we have to define exactly what 
average. Over which time period do we perform averaging? We will illustrate this 
on Fig. 4.5, where the time interval is only 5 years long. 

Let us point out that Fig. 4.5 corresponds only to a restricted N-body problem, 
where the Sun is fixed at the origin of heliocentric coordinates. In the real N-body 
problem (4.7), the Newtonian barycenter of the Solar system (see Fig. 4.4) is placed 
at the origin of Cartesian coordinates. These two problems have, of course, different 
modeling errors. 

Elliptical orbits of test particles can only be obtained for the central force field, 
which is proportional to the gravitational potential 1/r. Nevertheless, this does 
not hold in the Solar system. For instance, the total weight of interplanetary dust 
around the Sun, which causes zodiacal light, is estimated to 10!© and 10!” kg. 
The book [303] admits that its impact is almost insignificant on the Mercury’s 
perihelion shift, together with the belt of asteroids, comets, etc. According to [312], 
the solar oblateness (quadrupole moment) contributes to the overall perihelion shift 
of Mercury only 0.0254” per century (see also [286]). On the other hand, a much 
larger value 3.4” per century is presented in Weinberg [369, p. 200]. He claims that 
the most probable reason for this effect is that the length of the rotational axis of the 
Sun oscillates. Also a differential rotation of the inner Sun may have a nonnegligible 
effect [81]. If the central core is an axially symmetric ellipsoid that rotates more 
rapidly than the surface of the Sun, then a part of the Mercury’s perihelion shift 
could be explained in another way. 

We should also take into account errors in the determination of physical constants 
(e.g. the gravitational constant G, the mass of the Sun and planets), the nonuniform 
influence of magnetic fields, tidal forces, gravitational aberration, etc. A large 
number of small errors can cause a nonnegligible total error e9 +e) + e2, see Fig. 1.9. 
Moreover, numerical results show that a finite speed of gravitational interaction also 
causes a shift of the line of apsides. 


Example 4.2. Consider only the isolated system Sun-Mercury. From (4.3) we get 
that the total relativistic aphelion shift of Mercury after 750,000 years is equal to 


750,000 - 0.43” ~ 90°. 


Using Table 1.1, we find that the distance between the Sun and the aphelion of 
Mercury’s orbit is 


a(1 +e) © 70- 10°km, 
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see the left part of Fig. 4.2. Here the aphelion of the Keplerian trajectory is the most 
right point. On the other hand, the most upper point corresponds to the relativistic 
aphelion shift after 750,000 years. The distance between these two points is /2-70- 
10°km ~ 100- 10°km. This simple academic example shows that the difference 
between the modeling errors of the two-body problem and its relativistic analogue 
can be very large on short time intervals. 


4.5 A Method of Albert Einstein 


If the General Theory of Relativity well describes the planetary motion, then the 
perihelion shift of Mercury calculated from Einstein’s equations for the whole Solar 
system should be close to the observed shift O from (4.5). However, Einstein’s 
equations for the Solar system are unlikely to yield the observed perihelion shift 
of Mercury O from (4.4) exactly, since any equation of mathematical physics is 
always just an approximation of reality and thus possesses a nonzero modeling error 
eo. Moreover, Einstein definitely did not solve his equations of General Theory of 
Relativity (3.1) with condition (3.3) for the Solar system analytically. These are 
represented by a very complicated nonlinear system of partial differential equations. 
Their exact solution is not even known for two mass bodies, since the left-hand side 
contains many thousand of terms (partial derivatives of scalar functions) as we shall 
see in Sect.5.1. This is due to the fact that there are, in general, 10 components of 
the symmetric metric tensor and 40 Christoffel symbols (see Sect. 3.1). 

Consequently, Einstein had to make a whole series of simplifications to get some 
value of the perihelion shift of Mercury (see (4.3)). In other words, formula (4.2) has 
not been derived as a consequence of Einstein’s equations in terms of mathematical 
implications. On the other hand, it could give a good prediction, if the corresponding 
approximations were under control. 

For instance, Einstein takes @ = 90° in spherical coordinates which corresponds 
to the equatorial plane and simplifies the calculation, cf. e.g. (3.6). 


Remark 4.9 Mathematical implications are different from consequences used in 
physics that may involve approximations. Let us present a very trivial illustrative 
example: If n = 8.999, then n is not divisible by 3. Now, let us approximate n by 
n = 9. Then we cannot claim that 7 is not divisible by 3. 


Einstein assumed that the curvature of space around the Sun is time independent 
and spherically symmetric. That is, the considered curvature of space does not 
include the gravitational influence of Mercury, Jupiter, and other planets to get a 
pure relativistic effect (cf. Fig. 4.2). Einstein considered the zero right-hand side of 
his field equations outside the Sun. For simplicity, Mercury is substituted by a test 
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particle with zero mass and its trajectory (geodesic) is described by 


dx” » dx? dx? 


ds2 + oT ds ds 


for arbitrary curvilinear coordinates x” (see [90, p. 835], [107, p. 3]). 


Example 4.3 Denote by Tp the orbital period of Mercury with zero mass and by T 
the orbital period of Mercury with actual mass m = 33.022-107* kg. Let us estimate 
the difference between their positions after one century. For M = Mo by Kepler’s 
third law a3/T? = {G(M +m)/n? we get T = 7,600,500s, cf. (4.2). Now from 
the relation T2(M +m) = 17M we find that 


To — T = 0.627s. 


During one century Mercury makes 
T 
n=— = 415.211 (4.8) 
T 


revolutions, where t and T have the same meaning as in (4.3). This corresponds to 
a total delay of 0.627n = 260.33 s if Mercury is replaced by a point with zero mass. 
During this time interval the difference between those two positions will be more 
than 13,000 km for Mercury’s speed of about 50 km/s. 


Einstein described the gravitational field around the Sun by a solution of the 
so-called weak gravitational fields, which uses the Parametrized Post-Newtonian 
(PPN) formalism for velocities v < c, see [249, p. 1069]. The fact that the speed of 
Mercury is considerably less than the speed of light enabled Einstein to perform a 
series of further simplifications. For example, Einstein neglected higher order terms 
when calculating Christoffel symbols. Hence, formula (4.2) should not be applied to 
strong gravitational fields as it is sometimes done. After five pages [90, p. 833-837] 
of further approximations Einstein got (without any error estimates) an ordinary 
differential equation for the angle ¢ whose solution leads to the elliptic integral (see 
Fig. 4.6) 


, (4.9) 


=[l+a(a)+ nf = 
a a ae MOY eer cme reer 


where @ > 180° is the angle between the radius-vector of perihelion and the radius- 
vector of aphelion of Mercury’s orbit (see Fig. 4.7), 


ee eee Tid ve, we “Ay =] 
ay = ——— = 1.432-10 m=, a= ——— =2.174-10 m 
a(1+e) a(1 —e) 


(4.10) 


4.5 A Method of Albert Einstein 85 


Fig. 4.6 Page 838 from the original Einstein’s paper [90] which contains an incorrect integration 
over the interval [a, a2] 


P 


Fig. 4.7 The letters S$, P, and A stand for the Sun, the perihelion, and the aphelion of Mercury’s 
orbit, respectively. The angle 4 PSA between the perihelion radius-vector and the aphelion radius- 
vector is denoted by @ 
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and by (3.27), 


M 
a= = 2953m (4.11) 
Cc 


is the Schwarzschild radius of the Sun. 
Because the integral in (4.9) has no known analytical solution, Einstein used the 
linear part of the following Taylor expansion 


: 1+ sax + 2022? + 
= AX ax iecs, 
Vl—ax 2 8 


which is a fairly good approximation on the interval [a , a2], since aaj « | for 
i = 1, 2. Hence, he expressed the angle ¢ from (4.9) as follows 


a2 d+ sax)dx 


=[l+a(o + 
iit eS Caracar 


? [+5 (1 + 02) | [+5 —| (4.12) 
= -—a(a; +a2)|=7 =——- |, ; 
fee 2a(1 — e) 
where the first equality should be replaced by © as explained above and the last 
equality holds due to (4.10). However, Einstein does not present any details how 


the above integration denoted by = was performed. We will postpone this important 
issue to Theorem 4.1. Note that the singularities near the end points a; and a2 
in (4.12) are integrable. 


Remark 4.10 From (4.12), (4.11), and Kepler’s third law a*/T* = GM/(4z7) it 
follows that after one period (more precisely, between two successive perihelion 
passages) the perihelion shifts about the angle 


2 ee ee en) 
CS ea) hg gy gee Pe 
i.e. the relationship (4.2), which according to (4.3) yields the idealized value of the 
relativistic perihelion shift of Mercury 43” per century (cf. the left part of Fig. 4.2). 
Here one again subtracts two numbers of almost the same size, namely ¢ — zr, see 
Example 4.1. 

All three of Kepler’s laws hold only approximately, since there is always a 
modeling error. Nevertheless, a natural question arises: Are we justified to use 
Kepler’s third law in deriving relativistic formula (4.2)? The radius vector of 
Mercury makes a larger angle than 27 between two passages of neighboring aphelia 
(or perihelia), see Fig. 4.7. Hence, T in (4.2) is not well defined. 
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Remark 4.11 It is curious that Einstein in [90, p.839] writes that astronomers 
observe also an additional perihelion shift of Mars’ trajectory of 9” per century, 
while his relationship (4.2) gives only 1’ per century. This large discrepancy is not 
mentioned in the current literature. In other words, the questionable value for Mars 
is suppressed, whereas the value for Mercury that fits to (4.1) is emphasized. 


Remark 4.12 A number of publications show (see e.g. [261, 312]) the value 
of (4.3) even to four significant digits 42.98” per century, when substituting more 
accurate values of a, e, and T to the simple algebraic Eq. (4.2). However, authors 
do not usually comment on the fact that this relationship was derived by many 
approximations of Einstein’s equations, i.e., the errors eg, e,, and e2 from Fig. 1.9 
are ignored. Thus, the value of Mercury’s perihelion shift (4.3) is not very reliable. 
Note that in [298, p. 1712] the value 


E = 42.9773350296' per century 


is presented, i.e., even to 12 significant digits! 


In 2010, Anatoli Andrei Vankov pointed out that there is a numerical error in 
Einstein’s evaluation of the integral appearing in (4.12), see [356, p. 21]. Further, we 
introduce a detailed calculation, so that everyone can check whether Vankov is right 
or not. The next statement is rather an example, but since it is important, we will 
call it a theorem. It yields a different value of ¢ than Einstein’s value from (4.12), 
see also Fig. 4.6. 


Theorem 4.1 We have 


nicotene: af” (1 + 3ax)dx 142% 4 vo, ee 
AA) + a2 =z —(a,+7ra2 —(a@1+a2 | 
a VR —o:)@ =a) 4 4 
(4.13) 
Proof Define the linear transformation €: [a1, a2] — [—1, 1] by 
x—S 
L(x) = ; (4.14) 
h 
where 
aes (4.15) 
2 
is the midpoint of [a@1, a2] and h = a2 — s half its width. Then €(a@,;) = —1 and 


£(a2) = 1, but also 


V-( = a1) (x — a2) = (hy + h)(hy — h) = hy 1 — y?, 
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where the substitution y = £(x) with inverse 
x=hy+s 


was used. Hence, dx = hdy and by (4.14) we get 


a d+ sax)dx _ (i % =) [ dy 4: ah [! ydy 
a WO = a1) = a2) 27 Jai lay 2 I aye 
as 
= (1+ —)x, (4.16) 


because the right-most integral exists over [0, 1] and cancels the one over [—1, 0], 
whereas the middle integral has arcsin y as antiderivative and thus evaluates to z. 
Multiplying (4.16) by [1 + a(a, + @2)] and applying (4.15), we find that 
a (1+ 5ax)dx 
ay SH = G1) @ = 3) 


a 
= [1+ aa + o2)Ix[ 1 + ya + @2)| 


[1+ a(a; + a2)] 


Sa a 
= x1 + gu +a2)+ oo +a2)"]. 


The proof is completed. Oo 


Remark 4.13 The correct integration from (4.12) gives by (4.13) that 


ov (1+ 5ax)dx 


=[1+ a 
o=[ a(ay + a2)] - /—(x% — a1) (x — @2) 


Sa a 2 
x1 + ym + a2) + re + a2) | 


x x1 + > ea + a2) | 


neglecting the quadratic term (of dimensionless order 107!>). This differs 
from (4.12). Hence, by (4.10) and (4.11) we get 


Sa _7 
é=2(¢-T)= m5 + a2) = 8.364-10°“ rad 
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for one period which is a different value than that in (4.2). This is 5/3 times the value 
that Einstein computed, see [172]. 

However, Vankov in [90] proposes to replace the factor | + a@(a@ + a2) in (4.13) 
by 1+ 5a(aty + a2) which then leads to the right-hand side of (4.12). 


Remark 4.14 The angle ¢ given by (4.9) is an approximation of another integral 


2 dx 
| | 
a, Vax3 —x2+Cx+D 


(4.17) 


where C and D are some suitable constants. According to Einstein [90, p. 838], a 
and a2 are two roots of the cubic polynomial 


ax? — x? 4+Cx+D=a(x —a)(x — a2)(x — B), 


where £ is the third root. From this we get —x* = —a(a; + a2 + )x? and thus 


1 
B=——-a,—- a2. 
a 


Einstein then approximates this third root by 1/a due to (4.10)-(4.11). Numerical 
integration based on the Chebyshev-Gauss quadrature formulae shows that 


o=[1+ Seta] [ a 
2 oa V—&% —a1)@ — a2) — ax) 


is a much better approximation of the integral in (4.17) than that of ¢ given by (4.9). 
This indicates that Vankov is right (see Remark 4.13 and also [139]). The equality 
sign = in the above integral should be again considered as ~. 

Einstein was likely satisfied that his value E = 43” per century agreed with the 
value obtained by Le Verrier (see Remark 4.1) and also Gerber (see Remark 4.2). 
Hundreds of publications claim that the value E = 43” per century (see (4.3)) is 
very precise without any proof, that it nicely fits to astronomical observations, and 
thus it perfectly confirms the General Theory of Relativity, see e.g. [87, 107, 152, 
154, 249, 257, 261, 268, 286, 295, 298, 303, 312]. Therefore, in the previous four 
sections we wanted to point out the main shortcomings which are not taken into 
account when investigating this ill-conditioned problem. 


4.6 Further Applications and Critical Remarks 


Einstein’s formula (4.2) is often applied to other bodies than Mercury. For example, 
substituting the values a = 9377km, e = 0.0151, and T = 7.65h corresponding 
to Phobos into (4.2), we can derive the quite large value of the pericenter shift of 
23” per century. This Martian moon is perhaps another suitable candidate for testing 
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the validity of General Theory of Relativity, since its trajectory is less disturbed by 
other bodies than for Mercury. A secular trend in Phobos’ longitude was already 
observed in [209, p. 1082]. For the innermost satellites of some other planets the 
formula (4.2) produces much higher pericenter shifts, e.g., for Jupiter’s moon Metis 
with a = 127,969 km, e © 0.04, and T = 7.075 h we get 1.37° per century. This is 
49” per year which is 115 times more than Mercury’s perihelion shift. Similarly, the 
pericenter shift of Io is 15” per year. These large values are difficult to verify due to 
various resonances between the Galilean moons. 

Einstein’s formula (4.2) has also been applied to the star $2 orbiting the 
supermassive black hole Sgr A* at the center of our Galaxy with pericenter speed 
12,000km/s. Its orbit is seen in the projection to the celestial sphere and the 
trajectory during the first observed period of S2 was measured only very roughly 
since 1992. Therefore, it would be more precise to measure the apocenter shift 
instead of the pericenter shift. The orbital period T of S2 is approximately 16 years, 
a ~ 970 au, and e © 0.885. Substituting these values into (4.2), we get 


en it, (4.18) 


The semimajor axis a was obtained by angular measurements. Nonetheless, since 
the distance 25.6 ly to Sgr A* is established only roughly, a more precise value of a 
is not known. 


Remark 4.15 Notice that the input parameters a, T, and e to the formula (4.2) are 
squared. Therefore, this formula is very sensitive to their accurate measurements. 
For instance, we have 


(a + Aa)? = a? + 2aAa + (Aa)’. (4.19) 


Hence, if the relative error in determining the value a is e.g. 1 %, then the error 
in a? is about 2% (we usually have |Aa| < a). All errors depicted in Fig. 1.9 
must also be considered. Furthermore, the rotation of the central black hole was not 
taken into account. There are, of course, many other errors coming from diverse 
sources like the pixel structure of CCD detectors, calibration errors, influence of 
neighboring stars, etc. Anyway, the authors of the paper [9] claim that the measured 
pericenter shift corresponds exactly to the value (4.18) and that this is another proof 
that General Relativity is valid (see also [392, 62 min]). It would be more precise 
to measure the apocenter shift. Note that (4.2) should be applied to a nonrotating 
central body while Sgr A* rotates. 


Remark 4.16 Itis curious that formula (4.2) always yields a positive perihelion shift 
€ > 0 even for the zero eccentricity e = 0. In this case, there are infinitely many 
perihelia, namely, the perihelion is at each point of a circular trajectory. In fact, 
by (4.2) we always have 


E> 24n3a°T~*c~* > 0. 
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However, this does not allow us to consider a retrograde perihelion shift with 
negative €, i.e., when @ < z in Fig.4.7. For instance, Andrea Ghez claimed that 
her team observed that the movement of the pericenter of the star S2 around the 
black hole Sgr A* is retrograde (see [391, 45 min]). Therefore, we suggest to wait 
several decades to find whether the formula (4.2) approximates reality well. 


Remark 4.17 For Newtonian elliptic orbits, the eccentricity is defined as follows 


where b is the length of the corresponding semiminor axis. However, what is 
the definition of e for non-elliptic orbits? (Cf. Fig.4.4.) How do we define the 
semiminor axis b for the situation sketched in Fig.4.7? Is b = |BD|/2 or b = 
|C D|/2? How to define B, C, and D at all? Should we define a as a = |AP|/2 or 
a = (|AS| + |PS|)/2 or by another manner? Formula (4.2) thus contains not well- 
defined quantities a and e. Moreover, the static space around the Sun is curved. So 
are a and q@2 in (4.10) well defined? 


Remark 4.18 Although formula (4.2) was derived for a massless test particle 
representing Mercury, it is often applied to close binaries whose components have 
positive masses M, and M2 (see e.g. [18, 40, 121, 131, 334, 336, 372]). Note that 
PSR J0737-3039 is the only known double pulsar. By Lyne [228] it has a relativistic 
periastron advance. The mass M = Mo in Remark 4.10 is usually replaced by the 
sum M, + Mp) using the PPN formalism, see e.g. [377, p.47]. However, the exact 
solution of Einstein’s nonlinear equations for two bodies with positive masses is not 
known. Hence, we cannot verify if such a simple approximation for large orbital 
velocities and strong gravitational fields is correct. 


Remark 4.19 Einstein’s approach does not take into account the influence of tidal 
forces, since Mercury is replaced by a point and the gravitational field around the 
“nonrotating” Sun is supposed to be spherically symmetric (it is approximately 
described by the exterior Schwarzschild metric tensor (3.6)). Therefore, no tidal 
bulges are possible. 

Tidal forces are not taken into account also in the classical N-body problem 
which yields the calculated value C. The actual trajectory of Mercury can be slightly 
deformed due to the locking resonance 2 : 3 between the orbital and rotational 
period. This resonance may have a nonnegligible influence on the actual perihelion 
shift as well. 


4.7 Conclusions 


In 1915, Albert Einstein derived the value of the additional relativistic perihelion 
shift of Mercury 43” per century. In his article [90, p.831] he claims that Le 
Verrier needed approximately 45” per century to explain the difference between the 
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observed and calculated perihelion position. This caused a sensation and Einstein’s 
General Theory of Relativity suddenly became famous. The value of relativistic shift 
(4.3), however, applies only to the very idealized case, which may be quite different 
from reality. It was obtained by many approximations, assumptions and also several 
errors, see [90] for their list. 

According to [139], in 1913 Einstein wrote a manuscript on the motion of the 
perihelion of Mercury together with his friend Michele Besso. Although Besso 
corrected many errors in their common calculations, Einstein does not mention him 
in the final version [90]. 

Hence, Einstein worked on the problem of Mercury’s perihelion shift for at least 
2 years. Nonetheless, when [90] has appeared, he also claimed that: Ich habe die 
Gerbersche Arbeit urspriinglich schon deshalb nicht erwdhnt, weil ich sie nicht 
kannte, als ich meine Arbeit tiber die Perihelbewegung des Merkur schrieb, which 
can be translated as follows: I have not mentioned the work by Gerber originally, 
because I did not know it when I wrote my work on the perihelion motion of Mercury. 
It is, however, interesting that Einstein used a very similar notation as Gerber in 
[113], namely, the angle ¢ from Fig. 4.7 satisfies ¢ = ®/2, where ® appears in 
Gerber’s formula, see Remark 4.2. 

It is very difficult to separate relativistic effects from effects of Newtonian 
mechanics of similar size, and many other approximations. In formula (4.4), two 
almost equally large numbers are subtracted that are, in addition, perturbed by 
various errors. In other words, we do not evaluate the difference O — C, but O-C 
without any guaranteed error estimates, where O = O and C & C which are 
also not uniquely defined. The estimated difference of Mercury’s perihelion shift 
obtained from astronomical observations and numerical solution of the problem of 
N bodies is therefore ill-conditioned, see [356]. According to [312], formula (4.2) 
represents a weak experimental and theoretical confirmation of General Relativity. 

The main objections against the proposed value (4.3) can be summarized as 
follows: 


1. The input parameters a, e, T from (4.2) are not well defined, see Remarks 4.10 
and 4.17. 

2. The formula (4.2) is quite sensitive to the accuracy of all input data due to the 
relation (4.19), see Remark 4.15. 

3. Since Mercury’s orbit with b = 0.98a is almost circular, to determine 
accurately the exact position of perihelion is an ill-conditioned problem, see 
Sect. 4.3. 

4. Since the proposed relativistic perihelion shift 0.43” yr~* is a quite small 
number, all error estimates in deriving this value should be done very carefully. 
However, this is not done in the literature. 

5. The modeling error (see Fig. 1.9) of Einstein’s equations is not known. 

6. The observed perihelion shift O from (4.4) is not uniquely defined, see 
Sect. 4.3. 
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7. The calculated perihelion shift C from (4.4) is also not uniquely defined, see 
Sect. 4.4. Moreover, the subtraction O — C of two numbers of a similar size is 
burdened by a large relative error, see Sect. 4.2. 

8. The formula (4.2) does not allow a nonpositive shift, see Remark 4.16. 

9. The tidal forces of Mercury and the Sun are ignored, see Remark 4.19. 

10. The barycenter of the Solar system shifts every day by about 1000 km, whereas 
the additional relativistic perihelion shift of Mercury is on average only 96 km 
in a year, see Fig. 4.4. 


The Einstein value E from (4.3) was derived under many simplifications and 
approximations without any error estimates, see Sect. 4.5. The list of these simpli- 
fications would be too long, e.g., a special form of the metric tensor is prescribed, 
the real rotational axis of the Sun is not perpendicular to Mercury’s orbit, Lense- 
Thirring precession effect is ignored, the spacetime curved by Mercury, Jupiter etc. 
is replaced by Euclidean space, the PPN formalism is used for velocities v < c, 
higher order terms are neglected. Retardation effects of gravitational interaction on 
the perihelion shift of Mercury are studied in [382]. 

In this chapter, we primarily wanted to point out that we should not indiscrimi- 
nately take the value (4.3) and spread it further without checking how it was derived. 


Chapter 5 m®) 
Computational Problems of Einstein’s ore | 
Equations 


5.1 On the Explicit Form of the First Einstein Equation 


In this section, we want to point out the extreme complexity of Einstein’s equations 
which does not allow us to find their analytical solution of the N-body problem 
for N > 1. Applicability of Einstein’s equations in the relativistic community is 
therefore overrated. 

In (3.29), the dependence of the Ricci scalar R and the Ricci tensor R,, on the 
metric tensor g,,, is not indicated. Consequently, Einstein’s equations (3.29) seem 
to be quite simple. In [249, p. 42], one even can read: 


No equation of physics can be written more simply. 


To avoid this deceptive appearance, we will show now how to derive an explicit 
form of the first Einstein equation of system (3.29). The form (3.29) is evidently not 
suitable for computer implementation. 

First we shall consider only the case when T,, = 0 with zero cosmological 
constant which is due to Theorem 3.11 given by the relation 


Ruy = 0. (5.1) 


Nevertheless, this apparently very simple equation takes an extremely complicated 
form when the unknown metric tensor g,,, is expressed explicitly as we shall see 
below. The reason is that mathematicians mark all unknowns in the equations they 
solve, whereas in (5.1) no unknowns and also no coordinates are indicated. 

So now we will rewrite (5.1) such that the metric tensor explicitly appears there. 
Using (3.4), we can express the first Einstein equation of (5.1) as follows 


0 = Roo = Tox — Mox,0 + Poole — owl on 


0 1 2 3 0 1 2 3 
=P 0,0 + F001 + 10,2 + 00,3 — 00,0 — F 01,0 — F’02,0 — 103,0 
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0 0 1 2 1 0 1 2 

+P oOo + Pon + Poo + Pog) + Poo 9 + Pa + PA + P43) 
2 0 1 2 3 0 1 2 

+ D909 +P yy + P99 + P93) + Pog (PG9 + P51 + 752 + P43) 


0 pod 0 1 0 p2 0 3 
— Pool 00 — Mol 00 — M02! 00 — M03! 00 


2 10 2 pl 2 p2 2 73 
— Tool 02 — Por 02 — Moa! 02 — Pos! 02 
3 pd 3 1 3. p22 3 3 
— Mool™o3 — Port 03 — Moal03 — Mosk 
where the underlined terms cancel. Hence, the first Einstein equation can be 
rewritten by means of the Christoffel symbols of the second kind in the following 
way: 
0 = Roo 


=a 2 3 1 2 3 
=P 0,1 + P00,2 + 00,3 — Fo1,0 — F2,0 — P03,0 


+ PGF or + Poo + P'3) +P oo(—P 9 + + P12 + P43) 
2 0 1 2 3 3 0 1 2 3 
+ Pog(—I'a9 + Pay + M92 + P93) + Pog (—M'g9 + P'31 + 199 + 133) 


~ 20 olor 7 2ro3lo1 ~ 203! 02 ~ (Co _ (T) ~ (P'3)”- 
(5.2) 


Using (3.2) and the symmetry of g,,,, we obtain 
7) aie = a (aun + 8v0,u — Spv,0) oF e uiy + 8vl. — Sp, 1) 
+ 2 (py + 802.4 — Suv,2) + po Gay + 8v3,u — Spv,3) 


and thus by (5.1) we can express the first Einstein equation Rog = O by means of 
the metric coefficients and their first and second order derivatives as follows: 


0 =4Roo 
= 2[ g'? g00,0 + Ca (2801.0 — 00,1) + g') (2802.0 — g00,2) 


ts oa (2803,0 — 800,3) 


+ g!° 200,01 + g!!(2g01,01 — 800,11) + g'°(2802,01 — 800,21) 


+ g(2g03,01 — 800,31) 
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+ 3°)'g00,.0 + 83 (2801.0 — 800.1) + 873 (2802,0 — 800,2) + 873 (2803.0 — 800.3) 
+ ¢”° g09,02 + 87! (201,02 — 800,12) + 8” (2802,02 — 800,22) 


+ g73(2g03,02 — 800,32) 


+f 83 800,0 + v3 (2g01,0 — 00,1) + 5 (2802,0 — g00,2) + gs (2803,0 — 00,3) 
+ 9°" 800,03 + 8°! (2g01,03 — 800,13) + 8°” (202,03 — 200,23) 


+ g°3(2g03,03 — 200,33) 


= 81) 800.1 = 8 S11,0 = 819 (802,1 + 12,0 — 801,2) 
= 819 (803,1 + 13,0 — 01,3) 
— g'°g00,10 — 8'! 811,00 — 8'*(802,10 + 812,00 — 801,20) 


— g!3(go3,10 + 213,00 — 801,30) 


cs 8 00,2 = go (g01,2 + 821,0 — 02,1) — 8% 22,0 
= wa (g03,2 + 23,0 — 02,3) 
= g7° 900,20 — 87! (801,20 + 821,00 — 802,10) — 8°" 822,00 


— g°3(g03,20 + 223,00 — 202,30) 
= 8°) 800,3 — 89 (801,3 + 831,0 — 803.1) — 86 (802,3 + 832,0 — 03,2) 
— 8° 833.0 
30 31 32 
— 8” 800,30 — & (01,30 + 831,00 — 803,10) — 8” (802,30 + 832,00 — 803,20) 
= °° 833,00] 
+ (3° g00,0 — 8°! 00,1 — 8°” g00,2 — 8°° 80,3) 
x [g!°(2g10.1 — 8110) +8) 8111 +8? (2812.1 — 811.2) + 8° (2813.1 — 811,3) 


+ g°°(g10.2 + 920.1 — 812.0) + 8 gi.2 +. 8°" 8221 


+ 97 (913,2 + 923.1 — 812.3) 


+ ¢°°(g10,3 + 830,1 — 813.0) + 8° 811.3 + 8° (812,3 + 832,1 — 813,2) 
+ g933,1] 
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+ (g'°g00.0 +g! g11.1 — 88112 — 8! 811,3) 


x [= 8°" 800,1 — 8° 811.0 — 8°" (812,0 + 802,1 — 810,2) 


— g° (913.0 + 803.1 — £10,3) 
+ g!°(2g10,1 — gi10) +8 gi +8 (2812.1 — 811,2) + 83 (2813,1 — 11,3) 


+ 97° (g10,2 + 920.1 — 212.0) + g7 811.2 + 87" 822.1 


+ 97 (913,2 + 923.1 — 812.3) 


+ ¢°°(g10,3 + 830,1 — 813.0) + 8° 811,3 + 8° (812,3 + 832,1 — 813,2) 
+ g933,1] 


+ [7° g00,0 + 87! (2g01,0 — 800.1) + 8” (2802,0 — 80,2) + 87° (2803,0 — 800,3)] 


x [ - 2° 800.2 — g°" (g21,0 + 801,2 — 820,1) — 8°" 822,0 


— ¢°3(g03,0 + 803,2 — 820,3) 

10 u 12 B 
+ g °(820,1 + 10,2 — 821,0) +8 8112+ 8 “8221 + 8 ~ (23,1 + 813,2 — 821,3) 
+ g°°(2g00,2 — 822,0) + 87 (2821,2 — 822,1) + 8°82,2 + 87° (2893,2 — 822,3) 


+ g°°(920,3 + 830,2 — 8230) + 8°! (821,3 + 831,2 — 823,1) + 8822.3 + 8833.2 | 


+ [3°°200,0 + 8°! (2g01,0 — g00,1) + g°°(2802,0 — 00,2) 


+ g°°(2g03,0 — g00,3)| 


x [ = 8°" 800.3 — 8°! 801,3 — 8°" (832,0 + 802,3 — 830,2) — 8°°833,0 
+ g!°(g30,1 + 8103 — 831,0) + 8! 811.3 + 8 '°(832,1 + 212,3 — 831,2) + & 8331 


+ 979°( 239.2 + 920,3 — 832.0) + 87! (831,2 + 821.3 — 32,1) + 88203 + 8° 833.2 


+ ¢°(2¢30,3 — 833,0) + 8°! (2g31,3 — 33,1) + 8° (2832,3 — 33,2) + 8°°833,3] 
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— 2[g'g00,2 + 8 (8o1,2 + 821,0 — 02,1) + 8 82,0 

+ g'°(g03,2 + 823,0 — 802,3)| 


x [e°° 80,1 + 97! 11,0 + 9°°(g02,1 + 12,0 — 801,2) + 8 (803,1 + 813,0 — go1.3) | 


— 2[¢!°g00,3+8'' (g01,3 + 831,0 — 803, +8 '*(802,3 + 832,0 — 803,2) +8 °833,0] 


x [87° 800.1 + 8°! 811.0 + 8°°(802,1 + 812,0 — 801,2) + 8°°(803,1 + 813,0 — 801,3)] 


— 2[ 27° 00,3487! (go1,3 + 831,0 — 803, +8" (g02,3 + 832,0 — 803,2) +8 233,0] 


x [97° 800,2 + 8°! (go1,2 + 21,0 — 802,1) + 8°7822,0 + 8°°(803,2 + 823,0 — 802,3)] 


2 
— [g!°go0,1+8'! gi1,0+g!*(g02,1 + 812,0 — 801,2)+8 (803.1 + 813,0 — 01,3) | 

2 
= [27° 00,2+87! (go1,2 + 821,0 — 802,1)+8"822,0+87*(803,2 + 823,0 — 802,3) | 


2 
— [¢°°g00,3+8°! (01,3 + 31,0 — 803, +8°"(802,3 + 832,0 — 803,2) +8 83,0] - 
(5.3) 


Now we should substitute all entries of (5.3) with double upper indices to (3.32). 
For instance, the entry g!' in the fourth line of (5.3) could be rewritten by means of 
the Sarrus rule for 3 x 3 symmetric algebraic adjoints gj, by 


Wb 8i1 
det(gyv) 
800822833 + 2802803823 — go0(g23)” — g22(g03)” — g33(go2)” (5.4) 
eres, (—1)58"7 2009 81v1 8228303 , 
where the sum in the denominator contains 4! = 24 terms. Note that the optimal 


expression for the minimum number of arithmetic operations to calculate the inverse 
of a 4 x 4 matrix is not known, yet. The other nine entries g9, g°!, g%, 29, gl?, 
g!3, g*?, 273, and g* can be expressed similarly. 
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However, we have to evaluate also the first derivatives of g””. Consider for 
instance the entry at in the second line of (5.3). Then by (5.4) we get 


el! =—( Si ) 
1 Ax \det(gyv) 


1 
= 800822833 + 2802803823 
( eer (= 1)88°* go) 814 8207 833 ( 


— g00(g23)” — g22(go3)” — 833 (en) 
ll 


=| (¢00.1822833 + 2802,1203823 — g00,1(23)” — 922,1(803)* — 933,1(802)" 
+ 800822,1933 + 2202803,1823 + 800822833,1 + 2202803823,1 — 2800823,1 


= 2622803,1 = 2833802.1)( (=1)*8"* go.» 8101 82028303) 


WES, 
— (g00822833 + 2802803823 — 00(823)” — 822(g03)” — 933(802)°) 


x a (—1)8"* (g0v9,181v; 82128303 + 80v98111,182028303 
TES 


+ 80v9 81 vy 82v2,183v3 + 8081», 8228303.1) | 


—2 
x (OED gow sin 82283) (5.5) 


wES4 


Remark 5.1 Substituting all g“” and also their first derivatives into (5.3), we get the 
explicit form of the first Einstein equation Rog = O of the second order for the 10 
unknowns goo, 201, £02,---, 233- It is evident that such an equation is extremely 
complicated. Relation (5.2) takes only 5 lines, relation (5.3) takes more than 50 lines 
and after substitution of all entries with determinants given by (5.4), (5.5), etc., into 
(5.3), the Eq. (5.1) for the component Roo of the Ricci tensor will occupy more than 
10 pages. The other nine equations R,,, = 0 can be expressed similarly. 


Remark 5.2 The explicit expression of the left-hand side of (3.29) for a given 
covariant divergence-free tensor T,,, 4 0 in terms of the unknown components of 
8uv is even more complicated. Using (3.30) and (5.3)-(5.5), we still have to express 
the term —5 Rep similarly. As of 2022, nobody has calculated how many terms 
Einstein’s equations really contain, in general. 
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Einstein’s equations are nonlinear in the first derivatives of the metric tensor, 
since the Ricci tensor (3.4) contains a number of multiplications of two Christoffel 
symbols. From (3.4) it seems that these equations are linear in the second derivatives 
of the metric tensor. Now we prove that this is not true. 


Theorem 5.1 Einstein’s equations (3.52) are not linear in the second derivatives 
of the metric tensor. 


Proof Obviously, it is enough to deal only with the Ricci tensor defined by (3.4). 
From the third line of (5.3) we observe that the second derivative go1,91 is multiplied 
by the entry g!! of the inverse metric tensor. From (5.4) we find that this entry 
depends highly nonlinearly on the metric tensor. Moreover, the second derivative 
801,01 appears in (5.3) only once, so it cannot be canceled by another term. 

Similar considerations can be made if Einstein’s equations are written in mixed 
or contravariant from. Hence, we again cannot avoid to include the entries of the 
inverse metric tensor, since 


RY = gl Ruy, REY = gh Ry, 


The proof is completed. Oo 


5.2 Difficulties with Initial and Boundary Conditions 


Einstein’s equations (3.29) without initial and boundary conditions may have more 
different solutions (see e.g. Theorem 3.10). Since (3.29) represents a system of 
hyperbolic equations with second order time derivatives, two initial conditions 
should be prescribed. In particular, one should prescribe initial conditions for all 10 
components g,,) and all their 40 first derivatives. However, this is almost impossible 
in general case. 

There are similar large problems with boundary conditions and also with the 
divergence-free right-hand side (3.31) of Einstein’s equations (3.29). Moreover, Ein- 
stein’s equations are highly nonlinear. Therefore, the existence and uniqueness of a 
weak solution of Einstein’s equations with some initial and boundary conditions is a 
serious problem. Note, for example, that the one-dimensional steady-state nonlinear 
heat conduction equations with any prescribed type of boundary conditions does 
not have a unique weak solution even when the heat conduction coefficient depends 
continuously on the temperature, see [132, p. 182]. 

It is usually difficult, if not impossible, to prescribe explicitly any appropriate 
initial and boundary conditions for non-spherically symmetric regions for gyy 
satisfying (3.29). The reason is that matter tells spacetime how to curve and 
spacetime tells matter how to move [249]. This is the so-called back reaction. Hence, 
the initial space manifold is a priori not known except for some special cases when 
an analytical solution of Einstein’s equations is known. More precisely, the left-hand 
side of (3.29) corresponds to geometry and the Einstein tensor G,,, tells spacetime 
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how to curve. The right-hand side of (3.29) corresponds to physics and the tensor 
T,v tells matter how to move. 


Remark 5.3 Another non-negligible problem lies in the nonuniqueness of topology. 
The reason is that the knowledge of the metric tensor g,, does not determine 
uniquely the topology and shape of the associated space-time manifold. For 
instance, Euclidean space E* has obviously the same metric tensor Suv = Sus 
i, v = 1, 2, 3, as the infinite cylinder S! x E?, but a different topology and shape for 
the time-independent case with 7;,, = 0 in (3.29). Here S! stands for the unit circle. 
Another manifold {(x, y) € E*| y = sinx} x E? has a Euclidean geometry, too. 
Hence, having the metric tensor, which satisfies Einstein’s equations, does not mean 
that we know what the corresponding spacetime manifold looks like. Its geometry 
is fully described by the Riemann curvature tensor defined in Sect. 6.5 that has 20 
independent components. Several other examples can be found in [249, p. 725]. 


5.3. Extreme Computational Complexity of Einstein’s 
Equations 


According to (5.3)—(5.5), we observe that Einstein’s equations are highly nonlinear. 
From Remark 5.1 we find that the explicit form of all 10 equations (3.29) will 
occupy more than one hundred pages. For comparison note that the Laplace equation 
Au = 0 has only three terms au / ax? on its left-hand side, i = 1, 2,3, and the 
famous Navier-Stokes equations have 24 terms. 

Let N denote the number of mass bodies. If N = 0, then the simplest solution 
of Einstein’s equations is the Minkowski metric tensor (3.5). If N = 1, then there 
are several other simple solutions of (3.29) that use spherical or axial symmetry of 
one body, e.g. the Schwarzschild solutions (3.6) and (3.16), or the Kerr solution (see 
Remark 3.11 and [249, p. 878]). However, these solutions are local and not global 
solutions (cf. Theorem 3.9). Moreover, for two or more mass bodies no analytical 
solution of (3.29) is known, cf. Fig. 5.1. Thus we have a serious problem to verify 
whether Einstein’s equations describe well the N-body problem with N > 1| (e.g. 
for the Solar system). For instance, Jupiter cannot be replaced by a test particle 
with zero mass as Einstein did for Mercury, compare with Example 4.3. Therefore, 
we are unable to reliably calculate any part of Jupiter’s trajectory from Einstein’s 
equations. 

There are many numerical methods for solving partial differential equations 
such as the finite difference method, the finite volume method, the boundary 
element method, the finite element method (see e.g. [44, 191]), etc. For numerical 
solution of Einstein’s equations we have to include back all arguments of the 
functions gy» = Pot. x) x’, xy, Suv.x = Pale, x! x?, x), and gyv,x. = 
Suv,xa(x°, x!, x”, x3) appearing in (5.3)-(5.5) for all u,v, ,4 = 0,1, 2,3. For 
simplicity, we will not take into account that Einstein’s equations are mutually 
dependent, see Remark 3.9. 
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Fig. 5.1 Schematic illustration of space deformations by two interacting bodies 


Consider, for instance, the following simplest setting of the finite difference 
method. One has to establish a four-dimensional regular space-time mesh, e.g., with 


n* mesh points 


(emt) fori lS 1 De: 
Then the 10 values Bula", ee. x), their 40 = 10x 4 first derivatives and 100 = 
10x (1+2+3-+4) second derivatives (of the Hessian) appearing in (5.3)—-(5.5) have 
to be replaced by finite differences at all mesh points. For instance, the following 
second derivative appearing in the second line of (5.3) can be approximated by the 
standard central difference as 


ON geod 
800,11 0; Xj, Xp HX) 
Ded 23 0 V1 2.23 
po Xe 7) + 800%; » X; —h, xz, x7) 


8o0(x}’, x} +h, xg, x7) — 2goo(x?, x i 
h2 


~w 


where h = n~! is the discretization parameter. The huge system of nonlinear partial 
differential equations described in Sect.5.1 would be then replaced by a much 
larger system of nonlinear algebraic equations for approximate values of the metric 
tensor at all mesh points at a given time level. In particular, at each mesh point 
the corresponding discrete Einstein equations will be much longer than Einstein’s 
equations themselves written explicitly. Hence, if e.g. n ~ 100, then the discrete 
system will occupy millions of pages of extremely complicated and highly nonlinear 
algebraic equations. 

It is well known that explicit numerical methods for solving evolution problems 
are unstable. Therefore, one should apply implicit methods. Nevertheless, up to 
now, we do not know any convergent and stable method that would yield a 
realistic numerical solution of the above system with guaranteed error bounds of 
discretization, iteration, and rounding errors. 
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Remark 5.4 Classical relativistic tests are based on verification of very simple alge- 
braic formulae (see e.g. (2.22), (3.7), and (4.2)) derived by various simplifications 
and approximations of the Schwarzschild solution (3.6) of Einstein’s equations 
(3.29). This very special solution corresponds only to the exterior of one spherically 
symmetric nonrotating body. However, we cannot test good approximation proper- 
ties of Einstein’s equations (3.29) by means of one particular exterior Schwarzschild 
solution. Such an approach could be used only to disprove their good modeling 
properties of reality. Analogously, good modeling properties of the Laplace equation 
Au = 0 cannot be verified by testing some of its trivial linear solution, since there 
exist infinitely many other nontrivial solutions and there are infinitely many other 
equations whose solutions satisfy Au = 0. Similarly, there exist infinitely many 
equations having the Schwarzschild exterior solution (3.6), cf. [167, p. 269]. So 
which equations do we test? 


From Sect. 1.5 we know that no equation of mathematical physics describes 
reality absolutely exactly on any scale. Therefore, each mathematical model has 
only a limited scope of its application. In particular, Einstein’s equations with 
cosmological constant A 4 0 (see (3.52)) also should not be applied to the entire 
universe as is often done, since they are nonlinear and thus not scale invariant 
(cf. Sect. 7.2). Note that the observable universe is at least 15 orders of magnitude 
larger than one astronomical unit. 


Remark 5.5 In case of a curved spacetime it is not possible, in general, to define the 
global coordinates (x, x), x?, x3), 


We should always state precisely in which admissible set we shall look for a 
solution of a particular problem. Let us illustrate it in the following very simple 
example. 


Example 5.1 The equation x* = 1 has no solution on the set of even numbers, it 
has exactly one solution x = | on the set of positive integers, it has two solutions 
x = +1 on the set of integers, and it has four solutions {+1, +i} on the set of 
complex numbers. 


Therefore, for solving problems of mathematical physics various function spaces 
were developed. For instance, the solution of the Poisson equation (see Example 3.8) 
with some boundary conditions is searched for in the Sobolev space H! of square 
integrable functions whose first generalized derivatives are square integrable, too 
(see e.g. [191]). Similarly, the solution of the Navier-Stokes equations is searched 
for in the space of divergence-free vector functions whose components are in H!. 
On the other hand, in most works on the GTR it is not specified in which admissible 
set one should look for a solution, cf. e.g. [107, 249, 270, 272, 332, 337, 369, 371]. 


Remark 5.6 The left-hand side of Einstein’s equations (3.52) contains the second 
derivatives of the metric tensor. However, will such an expression produce reliable 
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results on cosmological distances? If the metric tensor of the physical spacetime can 
be more faithfully described by higher derivatives or another nonlinearity, we get a 
discrepancy of equations (3.52) with reality on cosmological scales. 

Finally let us emphasize that Einstein’s equations are fully deterministic whereas 
the universe does not operate solely gravitationally due to quantum phenomena. 
Their effects can be observed not only on microscopic scales. For instance, we may 
decide whenever we wish to change the trajectory of an asteroid in an arbitrary 
direction by the famous kinetic impactor method. The evolution of the physical 
world is thus quite unstable with respect to initial conditions, since small quantum 
fluctuations in our minds can produce large consequences. Such processes are not 
described by Einstein’s equations. This reminds us of the classical philosophical 
problem of free will versus determinism. 


Remark 5.7 Einstein’s equations do not describe quantum phenomena governed by 
the Pauli exclusion principle which states that two or more identical particles with 
half-integer spins (i.e. fermions) cannot occupy the same quantum state within a 
quantum system simultaneously. The neutron is a spin 1/2 particle, that is, it is a 
fermion. 


The main objections to the practical use of Einstein’s equations for more than 
one axially symmetric body can be summarized as follows: 


Their extreme complexity, see Sect. 5.1. 
The impossibility to prescribe reliable initial conditions, see Sect. 5.2. 
The impossibility to prescribe reliable boundary conditions, see Sect. 5.2. 
The modeling error (see Fig. 1.9) of Einstein’s equations is not known. Is it 
small or large? 
5. The difficulty with the left-hand side of Einstein’s equations which is 
divergence-free, see Remark 5.2. 
6. The metric tensor does not uniquely define the topology of the corresponding 
spacetime manifold, see Remark 5.3. 
7. The extreme numerical computational complexity, see Sect. 5.3. 
8. No significant digit of the cosmological constant is known, see Remark 3.15. 
So what will we substitute for A when solving (3.52)? 
9. Einstein’s equations are fully deterministic whereas the physical world is 
stochastic, see Remark 5.6. 
10. Since the classical relativistic tests are based only on the exterior Schwarzschild 
solution, we cannot guarantee that Einstein’s equations describe the physical 
universe well, especially on very large or very small scales. These tests 
are based on verification of very simple algebraic formulae which are not 
mathematical consequences of Einstein’s equations, see Remark 5.4. 


> eS 


Chapter 6 Mm) 
Friedmann Equation en 


6.1 Nonuniqueness of the Notion Universe 


In this chapter we first derive in detail the Friedmann equation from Einstein’s 
equations applied to the entire universe. Then we point out various drawbacks of 
this approach (see also the next chapter). 

In 1584, Giordano Bruno wrote the treatise [50], where among other things he 
conjectured that the universe is infinite. From that time, opinions of the shape of the 
universe have often changed. Isaac Newton and many others understood the universe 
as Euclidean space E” for n = 3. 

In 1900, however, the German mathematician and physicist Karl Schwarzschild 
proposed (see [320, p.66]) that the universe at any fixed time might be non- 
Euclidean and even finite, i.e. having a finite volume. He envisioned it as a 
three-dimensional manifold. 

Note that non-Euclidean geometry arose in the first half of the nineteenth 
century during many attempts to understand the axiomatic construction of Euclidean 
geometry—especially in proving the independence of Euclid’s fifth postulate of par- 
allels. We include among its founders Carl Friedrich Gauss, Nikolai I. Lobachevskii, 
Janos Bolyai, Bernhard Riemann, Sophus Lie, Felix Klein, and many others. The 
history of the development of non-Euclidean geometries is described in detail in the 
review article [54]. 

By the Copernican principle humans on the Earth are not privileged observers 
of the universe. Thus, we are not at some umbilic point [192, p.530] as in the tip 
of an egg which satisfies the required isotropy only at this special point and its 
antipodal point. According to Einstein’s cosmological principle, our universe on 
each isochrone is homogeneous and isotropic on large scales. Roughly speaking, its 
curvature is constant at any point and in any direction. 

More precisely, homogeneity is the assumed property of the universe that at 
every fixed time instant and on large spatial scales the universe appears the same 
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Fig. 6.1 Fluctuations in temperature ~ 2.726 K of the cosmic microwave background (CMB) 
radiation that corresponds to the redshift z = 1089. This radiation arose when the space was 1090 
times smaller than at present and its mean temperature was almost 3000 K (photo from the Planck 
satellite) 


to all observers, wherever they are. In other words, at any fixed time the translation 
symmetry of the universe is required. This is, in fact, the Copernican principle. 

Similarly, isotropy is the assumed property of the universe in which the universe 
at large spatial scales would seem to an observer at any point to be the same in 
all directions, i.e., for each fixed time the rotational symmetry of the universe is 
required. 

Note that the term “Einstein’s cosmological principle” was first introduced 
by E. A. Milne in 1935, see [247]. Already in 1922, C. Charlier writes that the 
cosmological principle was invented by Einstein, although he did not call it that. 


Remark 6.1 The isotropy is continually verified by astronomers. For instance, it is 
confirmed by the cosmic microwave background radiation (CMB) which possesses 
tiny fluctuations of order 10~4 K from its mean temperature 2.7260 K on the whole 
celestial sphere, see Fig. 6.1. Also the well-known y-ray bursts show almost uniform 
distribution on the celestial sphere. Pictures of the Hubble Deep Field, the Hubble 
Deep Field-South, the Hubble Ultra-Deep Field, the Hubble eXtreme Deep Field, 
Webb’s First Deep Field, etc., illustrate this isotropy on very large cosmological 
scales as well. 


Remark 6.2 In the Hubble test of the homogeneity of the universe, one has to 
measure the apparent magnitude (energy flux f from a given galaxy). By [56] the 
number of observed galaxies in the sky brighter than f should vary as f~>/, see 
also [25, 272], [369, Chapt. 14]. For a modification of this test to y-ray bursts see 
[221]. There is no homogeneity in time due to the existence of the CMB and since 
quasars exist only at cosmological distances and galaxies evolve. 
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Note that isotropy at all points implies homogeneity at all points on the space 
manifold, see e.g. [370]. The converse statement is not true. For instance, a 
crystal of CaCO3 is homogeneous but anisotropic. For the Gddel homogeneous and 
anisotropic universe we refer to [332, p. 384]. 

Note also that the universe on scales of about 100 million light-years consists 
of large-scale structures as giant sheets and long fibers (filaments) of galaxies. For 
example, a fiber consisting of many thousand galaxies with a total length of 1.37 
billion light years was discovered, the so-called Sloan Great Wall. There is also 
the KBC void (roughly spherical) with diameter 2 billion light years and low mass 
density. This indicates that the assumption of homogeneity is not entirely correct. 


Einstein’s equations were not developed for a dynamical evolution of the 
universe. This was done later in 1922 by Alexander Friedmann [108] who derived 
from Einstein equations a nonlinear ordinary differential equation for the expansion 
function a = a(t). He assumed that the universe can be described by an expanding 
three-dimensional sphere 


S} = {(x,y,z,w) € E* |x? +y? +2? +w? =a’} 


which enabled him to avoid boundary conditions. The symbol tf stands for a cosmic 
time which is a measure of time by a physical clock with zero peculiar velocity in the 
absence of matter over-/under-densities (to prevent time dilation due to relativistic 
effects or confusions caused by expansion of the universe). 

The current standard cosmological model of the evolution of our universe is 
based just on the Friedmann equation. However, Friedmann’s description was very 
brief. Therefore, the main objective of this chapter is a detailed derivation of the 
Friedmann equation for an unknown expansion function a = a(t) representing the 
radius of the universe. Note that the expansion function is sometimes also called the 
scaling parameter or expansion factor. A natural question arises whether we can 
apply Einstein’s equations that are tested on scales of the Solar system to the entire 
universe, which is a 15 orders of magnitude larger object than the astronomical unit 
au. 


Remark 6.3 The expansion of our universe is usually described by the hot Big Bang 
model. The term Big Bang was first used by Fred Hoyle in 1949. Nevertheless, 
the idea that the universe could have “zero radius” in the very distant past was 
formulated by Friedmann already in 1922. In the English translation of his article 
[108], footnote 11 states: The time since the creation of the world is the time that 
has flowed from that instant when the space was one point (R = 0) until the present 
State (R = Ro); this time may also be infinite. See the origin of coordinates in 
Fig. 6.2. Georges Lemaitre came to the same conclusion 5 years later, see [218]. 


The term “universe” is used in cosmology with various meanings: true spacetime, 
true space (i.e. a part of spacetime at any fixed time instant), and the observable 
universe, which is seen as a projection on the celestial sphere. These are three 
different objects. Their mathematical models are also three completely different 
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Fig. 6.2. Schematic illustration of three different manifolds that are used in the Big Bang model 
our universe with positive curvature index. For simplicity, the space dimensions are reduced by 
two. Hence, the sphere S3 with radius a = a(t) > 0 at any fixed time instant r is replaced only by 
its great (blue) circle S! for z = w = 0. This is the model of the space (universe) with positive 
constant curvature 1/a. The model of spacetime can be obtained by rotation of the (red) graph of 
the expansion function about the time axis t. The observable universe is marked by the (yellow) 
past light cone whose vertex corresponds to an observer. This light cone is deformed near the 
origin, but we do not know how. Each of these three models has a different center, see Remark 6.6 


manifolds (see Fig.6.2). Thus altogether we have 6 = 3 + 3 meanings of the 
problematic notion “universe” for which the terminology is not fixed yet. The first 
three contain real matter, whereas the other three are only abstract mathematical 
idealizations of reality. 

In accordance with the Einstein cosmological principle, we shall understand by 
the universe a cross-section of spacetime at any fixed time instant, i.e., the universe 
will be an isochrone in spacetime for constant t. For instance, if the curvature is 
positive (for a negative curvature see Theorem 6.5), then the corresponding model 
of the universe is the sphere S3 for some fixed radius a = a(t) > 0, which is a 
three-dimensional manifold in the four-dimensional Euclidean space E+ (cf. Fig. 6.3 
below). The associated model of spacetime in E> has dimension four and the model 
of the observable universe has dimension three (cf. Fig. 6.2). 

All six above-mentioned objects have to be carefully distinguished; otherwise 
we may come to various confusions. For instance, the observable universe is not 
homogeneous, since for different cosmological redshifts z, it has a different mass 
density. Thus, it is an entirely different object than the universe as a space. From a 
similar reason the spacetime is also not homogeneous. Therefore, the expansion of 
the universe is a completely different notion than the expansion of the observable 
universe. 

Albert Einstein is a part of the actual spacetime which is modeled by the red 
manifold in Fig.6.2. However, he is not a part of the current physical universe 
not even of the observable universe modeled by the blue and yellow manifolds, 
respectively. We shall see that the Friedmann equation (6.24) describes the time 
behavior of the blue manifold, whereas all data were taken from the observable 
universe modeled by the yellow manifold. 


6.1 Nonuniqueness of the Notion Universe 111 


Note that some galaxies are seen duplicate via gravitational lenses that corre- 
sponds to two different time instants. This nonuniqueness should be taken into 
account locally. 

In the observable universe we cannot measure angles a, 8, y in some large 
triangle to ascertain the spherical, Euclidean, or hyperbolic geometry of the universe 
by means of their sum a+ 6+ y. Such a triangle has to be considered in the universe 
(space), in which we see only our very close neighborhood (strictly speaking, only 
the one point at which we are situated). This limits our ability to perform such 
measurements. 


Remark 6.4 Let us emphasize that the observable universe cannot be modeled 
by the three-dimensional Euclidean space, since it is a cone whose interior does 
not belong to it. The whole situation is sketched in Fig. 6.2, where two spatial 
dimensions are removed. We see that the observable universe can be modeled by 
a light cone which is deformed towards the origin of the time coordinate. Instead of 
spheres Ss for a = a(t) we may consider only their great circles s! having centers 
on the time axis. 


Remark 6.5 It would be a mistake to believe that the known map of the cosmic 
microwave background (CMB) radiation from Fig. 6.1 shows the entire universe, 
how it looked like 380,000 years after the Big Bang. This map shows only a two- 
dimensional slice of a three-dimensional manifold corresponding to the universe for 
z & 1089 when its radius was z + 1 = 1090 times smaller than at present. This 
radius is the same as the radius of any great circle (e.g. equator) at the time when 
relict photons were emitted. Moreover, we observe everything only on the projection 
on the celestial sphere. For example, the relict radiation produced at that time in our 
neighborhood is not on the map of the cosmic microwave background radiation. We 
also do not see any relict radiation from the places where are to date all 10!* galaxies 
in the observable universe. At each of these galaxies we would observe at present 
completely different maps of the cosmic microwave background fluctuations, see 
[198, p. 277]. So on the Earth, an observer may have an idea about how only a tiny 
part of the early universe looked like. 


Remark 6.6 We often hear that the universe has no center. This is similar to the 
statement that a circle has no center. The circle x? + y! = |, of course, has its center 
even though it does not belong to it. In Fig. 6.2, the centers of the blue circles lie on 
the axis f. Therefore, also the model of the universe S? has its center at the origin 
(0, 0, 0, 0) of coordinates of the space a4 although (0, 0,0,0) ¢ S. On the other 
hand, the Earth is at the center (see the yellow vertex in Fig. 6.2) of the observable 
universe which is finite for any k € {—1,0, 1}. Its horizon can be modeled by 
a two-dimensional sphere. The center of the red model of the expanding universe 
corresponds to the Big Bang at an initial time (see also Fig. 9.10). The Big Bang 
thus happened everywhere. 
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Fig. 6.3 The unit circle on the left illustrates the sphere S! = {(x, y) € E*|x? + y* = 1}. The 
surface of the unit ball on the right is the sphere S* = {(x, y,z) € E3 |x? + y?4+ 22 = 1). It can 
be expressed using polar coordinates as follows: x =r cos¢, y =r sing, and the remaining third 
coordinate will be, by the Pythagorean Theorem, equal to z = 4/1 —r?, where r € [0, 1] and 
@ € [0, 277) 


For every n = 1, 2, ... define an n-dimensional sphere (hypersphere) with radius 
r > Oby 


1) ,2 2 2 
S? = {(tis.<<+;%na1) € E** |xptee+ + xy, =r}. 


Note that an isometry is a continuous mapping, whose inverse exists and is also 
continuous, and which preserves distances on a given manifold. 


Theorem 6.1 Forr < R the sphere S!’ can be isometrically imbedded into Se 


The proof is constructive, but we will not present it, even though it is easy. 
For instance, on the right part of Fig. 6.3 there are two circles corresponding to 
parallels at +30° isometrically imbedded into the two-dimensional sphere S?. Thus, 
each sphere can be isometrically imbedded into a larger sphere if the dimension is 
increased by one. In other words, the sections of the sphere S? by the hyperplanes 
Xn+1 = C, where C is constant and |C| < r, are spheres of dimension n — 1. 


6.2. How to Imagine the Sphere S*? 


In accordance with the original article [108] published by Friedmann in 1922, 
we assume that at any fixed time fr, the universe can be modeled by a three- 
dimensional sphere. From Sect.6.1 we know that Karl Schwarzschild already 
in 1900 conjectured [320] that the universe can be described as a huge three- 
dimensional sphere. Albert Einstein also assumed that the gravitational interaction 
of mass causes a positive curvature and that the universe can be modeled by a three- 
dimensional sphere S3 with unchanging radius a, see [94, p. 152]. This manifold is 
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maximally symmetric and the corresponding group of symmetries is the orthogonal 
group O(4). This expresses the fact that our universe is homogeneous and isotropic 


on 


large scales. If a = 1 we will, for simplicity, omit the subscript a. 
Contrary to e.g. a cylindrical surface, a two-dimensional sphere (see the right 


part of Fig. 6.3) 


S=(@y JB |xe+yt+2=) 


cannot be isometrically unrolled into the Euclidean plane E’. Therefore, all maps of 
the Earth’s surface have to be more or less distorted. Similarly, the curved sphere S* 
cannot be mapped into the flat space E* without any distortion of distances. 


Further, we will present 10 independent ways to imagine the unit sphere S*, see 


[188]: 


1. 


2. 


3. 


The sphere S? is actually a three-dimensional surface of a four-dimensional 
ball, see Fig. 6.3 for its lower-dimensional analogues. 

The second way uses parallel cuts. In the Euclidean space 
with coordinates 


~4. 


consider a point 


x =rsin@cos¢, 
y=rsindé sing, 


z=rcosd, 
w=tVv1—-r?, (6.1) 


where r € [0, 1], 0 € [0, zr], and ¢ € [0, 277). Then we easily find that 


x py tt w* =, 


and thus (x, y, z, w) € S®. For clarity, we now cut! the sphere S* by parallel 
planes w =const. for |w| < 1. This yields two-dimensional spheres with radii 
r = V1 — w®” and centers (0, 0, 0, w). Their union is S3. The points (x, y,z) € 
S? are then expressed in standard spherical coordinates defined by the first three 
equations of (6.1). The angle 9 = 0 corresponds to the North Pole and to the 
degenerated radius r = 0 (cf. Figs. 6.3 and 6.6). 

The third way uses the following hyperspherical coordinates 


x = sin x sind cos¢, 


y = sin x sind sing, 


' Tf we similarly cut the unit sphere S* by parallel planes z = const. for |z| < 1, we get circles (two 
of them are degenerated) with radii r = V1 — z2 and centers (0, 0, z), see the parallels in the right 
part of Fig. 6.3. The union of these parallels (circles) is the sphere S. 
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Z = sin x cos6, 


w =COS x, 


where x,0 € [0, z] and @ € [0, 27). It is easy to see that the equality x24 
y? +27 + w? = 1 holds again. We observe that the hyperspherical coordinates 
are a natural generalization of the standard spherical coordinates for the unit 
sphere S? taking y = 2/2 (cf. the first three lines of (6.1) for r = 1). 

Note that the manifold S* with radius 1 can be divided into infinitely many 
spherical shells with radii sin x for x € [0, 2]. The corresponding surface areas 
are 4zr sin” x. Using the above hyperspherical coordinates, we find that 


3 ane 1 7 2 
vol($3) = ax [ sin? y dy = 2n| x = sin2x) | = 2n?, 
0 
which leads to 
vol(S?) = 27a? (6.2) 


for an arbitrary radius a > 0 (cf. also [94, p. 152]). 

4. The fourth way relies on local orthogonal projections. For simplicity, we first 
investigate the sphere S*. On S* consider a small curved “square neighborhood” 
whose center is at the point (0, 1, 0) and whose sides are parts of great circles 
passing through the poles (0, 0, +1) and the points (+1, 0, 0) on the equator. 
The left part of Fig. 6.4 shows the orthogonal projection of this neighborhood 
to the tangent plane y = 1, e.g. meridians are projected to converging vertical 
arcs. 
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Fig. 6.4 Local orthogonal projection of a part of the sphere S” for n = 2 an = 3 into the n- 
dimensional tangent hyperplane. All broken and solid lines are parts of ellipses whose semimajor 
axes are equal to | 


6.2 
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Similarly, on the sphere S* we may consider a small curved cube, whose 
edges are parts of great circles. The right part of Fig. 6.4 shows the orthogonal 
projection of this cube to the three-dimensional hyperplane tangent to S° at the 
center of the curved cube. 


. The fifth way is the well-known stereographic projection. First, we again 


consider only the sphere S*. We will project from the North Pole N = (0, 0, 1) 
all the remaining points of S* onto the plane tangential to the South Pole 
S = (0,0,—1). Then the meridians of the sphere (see Fig. 6.3 right) are 
mapped to lines passing through S and the parallels are mapped on circles. 
Note that such a stereographic projection was used in the construction of the 
astronomical dial representing the celestial sphere at the Prague Astronomical 
Clock (Horologe), see [187, 194, 203]. This projection is a one-to-one mapping 
from S? \ N onto E?. 

Similarly, for the sphere S* we select two opposite poles N = (0, 0, 0, 1) 
and § = (0,0,0,—1). Then all points from S? \ N are projected from the 
North Pole N into a three-dimensional hyperplane tangent to S? at the South 
Pole S. Great circles passing through N and S will be again mapped on lines 
passing through S, etc. 


. The sixth way uses the Gaussian plane of complex numbers C. Setting 


x=x+iy, z=zt+iweC, 


the sphere 


S? = {(x, y,z,w) € E*| |x)? + lyl? +|zi? + |wl? = 1} 


can be expressed as a manifold resembling the unit circle in complex variables 


S? = {x,z) eC? | |x|? + [z= 1}. 


Moreover, we have |x|* = |x?| and |z|? = |z?|. 


. The seventh way employs the quaternions which is a noncommutative algebraic 


field. Similarly as for the complex plane we can find that the sphere S? is 
isomorphic to the quaternions with norm | (see [333)). 


. The eighth way uses special regular tessellations of S* to imagine this manifold. 


We can tile it with congruent bounded regular “polyhedral” cells. These cells 
are curved, since the manifold is not Euclidean. Edges of these tiles are 
geodesics and their faces consist of families of geodesics. Regular tessellations 
then represent a certain mesh that shows how the manifold is bent. 

Let S* be the circumscribed sphere of the regular tetrahedron, octahedron, 
and icosahedron. Then the radial (i.e. central orthographic) projection from the 
center of S? maps all edges onto S?. In this way we obtain regular triangular 
tessellations of S*, see Fig. 6.5. The sphere S? can be tessellated similarly by 
five curved regular tetrahedra, or 16 curved regular tetrahedra, or 600 curved 
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Fig. 6.5 Regular simplicial tessellations of the sphere S*: the projected tetrahedron, the projected 
octahedron, and the projected icosahedron. The corresponding vertex angles are 120°, 90°, and 
72°, respectively 


regular tetrahedra. Their dihedral angles at edges are again 120°, 90°, and 72°, 
respectively. 

9. The sphere S* can also be very roughly approximated by the three-dimensional 
surfaces of the 6 four-dimensional regular polytopes in E*. They are the 4- 
simplex, the 4-cube, the 4-orthoplex, the 24-cell, the 120-cell, and the 600-cell, 
see [44, p. 281]. 

10. The regular 120-cell whose three-dimensional surface consists of 120 regular 
dodecahedra quite well approximates S* and it can be nicely modeled by soap 
bubbles, see [333]. 


Remark 6.7 Assume again that the universe can be modeled by the sphere S3 with 
fixed a > 0. Then the farthest point from the Earth in space (the so-called antipodal 
point [44]) is located at the intersection of all the great circles of S3 that pass through 
the Earth. Its distance from us is a © 3a, 1.e., it is approximately 3x further than 
the present radius of the space. 


Note that Kurt Gédel worked on a theory of a rotating universe [116]. For any 
odd dimension n = 1,3,5,... the sphere S” may rotate around its center such that 
all points have the same speed |v| > 0 (because S” can be “combed” if n is odd). 
It is not a rotation around an axis as e.g. for n = 2. For n = 3 the corresponding 
centrifugal force could then contribute to the expansion of the universe (as in case 
of a rotating circle for n = 1). 


Remark 6.8 The present total mean density of our universe is about p ~ 
10-26 kg/m?, which can be achieved by having only a few protons per cubic 
meter. If the universe can be modeled by S°, its present radius is usually very 
roughly estimated to be a © cto ~ 107° m, where fo © 13.82 Gyr is the present age 
of the universe according to [290]. Then by (6.2) its total mass would be 


M = 2n7a°p © 2- 10°? kg 
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and during its expansion its radius increases with time, but its topology cannot 
change. So during the Big Bang the topology of the cosmic (cosmological) egg 
should also be Si, If there exists the maximum possible mass density e.g. ~ 
10!8 kg/m? (cf. (12.2) below), then the corresponding initial radius would be 
ay © 2-10!'! m ~ 1.3 au at the beginning. The expansion function a = a(t) can 
also be oscillating with minimum ag. Such a model does not contradict the Principle 
of causality as the Big Bang model does. 

Recall that a white hole is a hypothetical region of spacetime that cannot be 
entered from outside (although energy-matter, light, and information can escape 
from it). It is the reverse of a black hole which can be entered only from outside. 
Some cosmologists compare the Big Bang to a white hole, since it cannot be entered 
from outside because we cannot travel against time. However, such a white hole 
would have the same topology as the entire space, i.e., a different topology than an 
ordinary black hole. 


6.3 Metric Tensor for the Sphere S? 


The metric tensor of spacetime has in general 10 independent entries. Now we show 
that due to the high symmetry of the sphere S3 all the non-diagonal entries of this 
tensor vanish. 

From Sect. 6.2 we know that the unit sphere S° is actually a three-dimensional 
hypersurface in four-dimensional Euclidean space E*. Let P ¢€ S° and let 
P = Pv(r,@,@) whose coordinates satisfy (6.1). Similarly to [296, p.253], the 
components of local coordinate vectors for r < | are 


a 
pir, 0, ¢) = ap Pr 6, 6) = (sin@ cos ¢, sin sing, cosé, —r(1 — r*)~!/*), 
r 
a 
pr, 0, d) = aah 6,¢) = (rcos@cos¢,rcosé@ sing, —r sind, 0), 


p3(r, 8, @) := ahr 6, ¢) = (-rsiné@ sing, r sin@ cos @, 0, 0), 


where the vector p with components pj, p2, p3 lies in the hyperplane tangent at the 
point (x, y, z, w) € S? (cf. Fig. 6.6 for a two-dimensional sphere). 

The covariant coordinates of the symmetric metric tensor corresponding to the 
unit sphere S° are given by the relations 


Sop = Pa* PB for a, B= 1, 2,3, 
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Fig. 6.6 Local coordinate 
orthogonal vectors p; and p2 
in the plane tangent at the 
point (x, y, z) € S? given in 
the standard spherical 
coordinates (r, @, 0), cf. (6.1) 


where - denotes the scalar product. Hence, it follows that 


2 2 
~ 2 2 ind 2 “a _ A =. al 
211 = sin’ 0 cos" @ + sin’ 6 sin* @ + cos” 6 + =1+ = ; 

1-—r2 1-—r2 1-—r2 


822 = r” cos” 6 cos” ot r? cos” 6 sin” ot+ r’ sin? @ =r’, 


833 = r? sin’ 6 sin? @ +r? sin’ 6 cos’ @ = r’ sin” @, 
and the other entries vanish, because pj - p2 = pi - p3 = p2- p3 = O due to the 
symmetry of S*. The metric tensor corresponding to the unit sphere S? is therefore 


given by the diagonal matrix 


1 
0 0 
os 1-—r? oy 
8ij — @) r2 0 ’ l, J = 1,:2,.3, (6.3) 
0 Or*sin’?@ 


where r € [0, 1), 6 € [0, ], and ¢ € [0, 277). Note that the Schwarzschild exterior 
solution (3.6) is spherically symmetric in the spatial coordinates with respect to one 
point only. On the other hand, (6.3) is spherically symmetric at all points and this 
describes the isotropy of the model S*. Note that the metric tensor of S* in the 
hyperspherical coordinates is given in [178, p. 115]. 

The metric tensor associated to the entire spacetime manifold is then also given 
by the diagonal matrix 


1 0 0 0 
ae 
e421)" 7-2. 0 . wie: (6.4) 
0 O  -a?r? 0 
2 
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with the opposite signature than in (6.3) (cf. also (3.5)). The zero index corresponds 
to the time coordinate and the remaining three indices correspond to the spatial 
coordinates. 

In the sequel, the Latin indices will run through the set {1, 2, 3}, while the Greek 
indices through the set {0, 1, 2, 3}. 


Remark 6.9 In the theory of relativity, the metric associated to (6.4) is usually 
written using infinitesimally small quantities as follows: 


dr? 
= 


ds? = cdr? — ao] 5 +r7(d6? + sin? 9 d9”)], (6.5) 
r 


where the time coordinate fulfills 
xo = ct 


and r ¢€ [0,1) is dimensionless. The relation (6.5) of the space-time manifold 
modeling the evolution of the universe for a special expansion function a = a(t) 
first appeared in the article by Howard Percy Robertson [302, p. 826] in 1929. The 
same formula was also published by Arthur Geoffrey Walker [367, p. 121] in 1936. 
Therefore, the relation (6.5) is called the Robertson-Walker metric. Sometimes 
it is also called the Friedmann-Lemaitre-Robertson-Walker metric or just FLRW- 
metric due to the pioneering papers [108] and [218]. Note that Walker does not 
cite Robertson’s article [302], but only some of Robertson’s later works from 
Astrophys. J. 82 (1935), 284-301; 83 (1936), 187-201, 257-271. Consequently, 
the priority belongs to Robertson. 


For the entries of the covariant metric tensor (6.4) we thus have 
so=1, gui =0, gip=—a° (Rij fori, j =1,2,3, (6.6) 


where g;; is the metric tensor (6.3) of the unit sphere S3, ie., the tilde actually 
corresponds only to the expression in brackets in relation (6.5). 

Since the inverse matrix to (6.4) is also diagonal, for the entries of the 
contravariant symmetric metric tensor we have 


sM=1, gM =0, gi =a" (6.7) 
where 
po ade” ta gS 1 e/ =0 for ix j 
re’ r2 sin? 6 : 


r £0, and 6 £0. 
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6.4 Calculation of the Christoffel Symbols 


Entries of the Ricci tensor which appears in Einstein’s equations (3.29) are defined 
by means of the Christoffel symbols (cf. (3.4)). In the covariant form (i.e. only 
with lower indices), the Christoffel symbols of the first kind are by (3.8) defined as 
follows 


1 
Vopy = 7 Sap. + 8ya,pB — SBy,a)s (6.8) 


where the index after the comma indicates, for simplicity, the position number of a 
variable according to which we differentiate. By (3.9) the Christoffel symbols are 
symmetric in the second and third index, 


Popy =Vayp for a, B, y = 0, 1, 2, 3. (6.9) 


Therefore, there are generally only 4 x (1 +2+3-+4) = 40 Christoffel symbols. 
Now let us calculate all of them for the sphere S3. 
From (6.8) and (6.4) we see that 


1 
To00 = = = 0. 
000 = 5 800,0 


Since the off-diagonal entries in (6.4) vanish and the derivative of a constant 
function is zero, we obtain by (6.8) fori = 1, 2, 3, 


1 
Yoio = Too = 7 (80i,0 + 800,i — 10,0) = 0. 


Similarly, we get 


1 
Pio0 = 7 (8i0.0 + 80i,0 — 800,i) = 0. (6.10) 


All of the 7 Christoffel symbols, in which at least two indices are zeros, are thus 
equal to 0. 

Next, assume that just one index is zero. Then, according to (6.6) for i, j = 
1, 2, 3 we get only 6 Christoffel symbols due to the symmetry (6.9), 


1 1 o 
Poi; = = (80i,; + 80,1 — 8ij,0) = —s8ij,0 = 4a, 08i;, 
2, 2 


where 


ApS Se (6.11) 
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is a dimensionless quantity. For the additional 3 x 3 = 9 Christoffel symbols we 
have by (6.6) 


1 1 2 
Vio; =Vijo = 5 (8ii.0 + 20i,j — &j0,i) = 7 8ij,0 = — 44,083), (6.12) 

since we again differentiate with respect to the time variable. 

Finally, assume that all indices are nonzero, i.e., we will differentiate only with 
respect to spatial coordinates. Then by (6.8) and (6.6), we have 

1 ar ts es Pe peas 
Dijk = 5 (Bik + 8ki,j — 8jki) = Bik + 8ki,j — 8jki) = —a°T ijk 

and the number of these remaining Christoffel symbols is 3 x 6 = 18. Hence, we 
have established in total 7 + (6 + 9) + 18 = 40 values of the Christoffel symbols 
(which are generally different). 

Next we shall use Einstein’s summation convention for summation over repeating 
upper and lower index. To get once contravariant and twice covariant Christoffel 


symbol (i.e. with one upper and two lower indices), we use the following definition 
(cf. [249, p. 340]) 


i 
May = 8 Tapy 
with summation over A = 0, 1, 2, 3. In particular, we find that 
Ty = 8 Tay = 8° Tope = Toy, 


since by (6.7) the off-diagonal entries of the contravariant metric tensor vanish and 
g = 1. Similarly for i, j = 1, 2, 3 we have by (6.6) 


I Gy = g'*Tapy = gi T spy = eT py, 
as g'? = 0. Furthermore, from the equality (6.10) we have 
lie=0 
and by (6.12) we obtain 
oj = 5 8"T = —ai§(—aao)ain =, = (6.13) 


where 5, is the Kronecker delta, i.e. 84 = 5%, = 5 = | and oe = Ofori £ j. 
From relations (6.7) it follows that 


ro =9, I =0, Mog =s'*Ta0 =0, TS, =aaofi; (6.14) 


and 
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, ie : ae e 
=e Tae ele = — 58 (-a°T en) =e Tgp Tg (6.15) 


We see that in calculating the Christoffel symbols (6.15), the expansion function a 
is canceled out. 


6.5 Calculation of the Ricci Tensor 


First note that 
a Bin = 8, for i j,k = 1,-2,3; 


where 5, is the Kronecker delta. Using Einstein’s summation convention, we thus 
have 


§', =3 and 50 = 8, (6.16) 


l 


which can be seen by a step-by-step substitution for i, 7 = 1, 2, 3. 
Further recall the definition of the Ricci tensor (3.4) 


Rep = Re = Real eel a oP ker are -™, yr an» (6.17) 
where R” ap is the Riemann curvature tensor defined by 
Y  _py Y A ny a py 
Rup = ep. ~ Paw. + opl ay — Val ga: 
From this and (6.13)—(6.16) we come to 
_ py’ A py j J 
Roo = T09,) — Moy.0 + Foo —Pol on = =P =l'o ly 
xf), Cae 
a 0 a //\a_'! 
a 
= -3(4,0/4),0 — 3(ao/aY = 3, (6.18) 


where ao denotes the second derivative with respect to time variable xo, and 


Y Y 
Roi =1,,, — oy.1 


r oy A py 
Oijv eG yg = tape oy, 


J J o J pk 
mT Pagel a ae Gel yi 
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where we used the fact that T° th =f. a j = 0, because derivatives with respect to 
spatial coordinates vanish due to (6.13). Note that we will not employ the equality 
of Ro; = 0 to derive the Friedmann equation. 

Finally, according to (6.13)-(6.17) for a, 6 = 1, 2,3 we have 


h dh py 
Rj =p, - Pi gek gr ape l Gh a 
0 0 pk 
=Tiy0 + Pye - Pag tly Vox 
mk Te 0 k 0 mk Te 
ae ae Ne a ng =o el al 
=. 2\x 22> 25. 25. 
= Rij + (4.004 + 49) 80j + 348ij — 4 8ij — UO8ij 
= Rij + (aoa + 2a°)8ij = (a.ooa + 2a + 2)8ij, (6.19) 
where the sum of the underlined terms defines the Ricci tensor Rij = 2g); (see 


[369, p. 383]), which has nonzero entries only on the diagonal due to the maximal 
symmetry of the manifold S*, see (6.3). 


Remark 6.10 We see that the Ricci tensor contains no spatial derivatives of the 
metric tensor, although according to (6.17) it is defined by the first derivatives of 
the Christoffel symbols and these are by (6.8) defined by the first derivatives of the 
metric tensor. The reason is the very large symmetry of the sphere S? (see [369]). 


6.6 The Friedmann Equation for the Sphere S3 


Now we derive the Friedmann equation from Einstein’s equations (3.52). For 
simplicity, suppose first that the cosmological constant is A = 0. Then Einstein’s 
equations have the form 


1 8G 
7 R8uv = 4 Ty, (6.20) 


Ruy — 
where the left-hand side is the Einstein tensor, R = g’” Ry, is the Ricci scalar, the 
entries of the metric tensor g,,) are unknown gravitational potentials, and T,,, is the 


tensor of density of energy and momentum. Since 5 is maximally symmetric, all 
the tensors in (6.20) are diagonal and (see [107, p. 176]) 


ig=To=TP =p. Tr =T3=7,=—7, 
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where the mass density p = p(t) and the isotropic pressure p = p(t) are supposed 
to be continuous and independent of the space coordinates and pc is the density 
of energy, see e.g. [222, p. 108], [249, p. 728]. This corresponds to a perfect cosmic 
fluid (with zero viscosity). From this and (3.33) we have 


82G 1 
Ruv = (Tw - =T Suv); (6.21) 
c 2 
where 
T=T9+Ti = pc’ —3p 
is the trace and R = —82GT/c*. The 00-entry of these specially modified 
Einstein’s equations (6.21) takes by (6.18) and (6.6) the form 
ao 8G 1 4nG 3 
3 = (pe? goo(pc> — 3p)) = = (0+ ~t). 
a c 2 c c 


From this and (6.20), we get the linear differential equation of the 2nd order 


a mE (0+ 8), 


=-5 a (6.22) 


where the dot denotes the time derivative. This equation is sometimes called the 
linear Friedmann equation, even though it does not appear in the original paper 
[108] by Friedmann. 

Einstein’s equations (6.20) are nonlinear, in general, due to (6.17). Notice how 
the nonlinearity elegantly disappears in (6.18) for the sphere Sue 

Note that the other three equations Ro; = 87 Ge* (Ti _ xT g0i) = 0 fori = 
1, 2, 3 do not give us any useful relationship for the unknown functions a, p, p. 

Finally, according to (6.19), (6.7), and (6.21), we have 


7 82G 1 
(a,o0a + 2a% + 2)8ij = Rij = oa ( PS8ij z8ii(oe 3p)) 


4nG 4G 7 
= a Bil (p — pc?) = a #°Ri)(P = pe), 


Thus from (6.11) we obtain 


: : ‘ 4 Ga? = 
(aii + 2a” + 2c*)8ij = —Z— (we? — p)aij. 


This is altogether six equations, but the three corresponding to the diagonal entries 
are the same and the remaining three corresponding to non-diagonal entries vanish 
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and lead to the unusable identity 0 = 0. So we obtain only one equation 
adi +247 +2? = 4x Ga*(p = 5). (6.23) 
c 


Hence, together with (6.22), we get two equations for three unknowns a, p, p. As 
the third equation we may consider, e.g. the state equation p = p(p). 

Now multiply Eq. (6.22) by the function a” and subtract it from (6.23). Then we 
find that the second derivative a disappears and 


2 
2 2 >) ) 41 Ga *P) 
2a? + 2c? = 4nGa?(p- 5) += (o + =), 


which after a simple adjustment leads to the sought nonlinear Friedmann equation 
of the first order 


ac? 8xGp 


6.24 
- 3 (6.24) 


which does not contain the pressure p. 

Note that (6.22)—(6.24) are three equations, but only two of them are independent. 
Let us further point out that Lemaitre in [218, p. 52] derived the same equation (6.24) 
without citing Friedmann. 


Remark 6.11 The case of a nonzero cosmological constant A 4 0 can be formally 
rewritten by appropriate transformations (6.25) to the case A = 0. For A ¥ 0 let us 
set 

Act Ac? 


_ d p:= =, 6.25 
ee Pe ae OF) 


P:=p 


i.e., the pressure p can also attain negative values. We see that Pp + pc? = p + pc? 
and that the equation 


1 8G 8G 2 
Ri 5 Rou — Aguy = ae Tuy = na ((p + pc*)U,Uy — pg); 


where U,, = (1, 0,0, 0), changes into the form (6.20) without the cosmological 
constant 


1 82 G — ae) — 
Ruv — 5 R8uv = a (@P + pe“)ULUy — P8yv)- 


Then we obtain like in (6.22) 


a “2p “PY 2 ( 3p | 


126 6 Friedmann Equation 


and instead of the Friedmann equation (6.24) we get 


a’ ; Cc? 8nGp _ 8xGp , Ac? 


= = 6.26 
a ge 3 3 7 3 oy 


a 


The corresponding graph of the expansion function a = a(f) is given in [197, p. 70]. 


6.7 Division by Zero in Cosmological Parameters 


For a given smooth positive expansion function a = a(t) define the Hubble 
parameter by 


a(t) 


H(t) := a (6.27) 


It can be thus expressed as the time derivative of the natural logarithm in the 
following way 


a(t) 
d{ In — 
H®= ( 3” ) _ = 


where ag > O is an arbitrary length constant and thus the argument a(t)/ag is 
dimensionless. 

Since the Friedmann equation (6.26) contains too small terms, we divide it by the 
square H? > 0 as is usually done in the literature on cosmology, i.e., without any 
preliminary warning that we may possibly divide by zero. This may lead to various 
paradoxes (see e.g. Remarks 6.12 and 6.13 below, and [173, 196, 198]). Then we 
get the so-called normalized Friedmann equation 


1 = Qm(t) + Qa(t) + Okt) (6.28) 
for three normalized cosmological parameters, which are defined as follows 


_ 82 Gp(t) _ Ac? _ kc? 
Qm(t) = 3H) > 0, Qa (t) = 3H)’ QxK (t) are 


(6.29) 
where k € {—1,0, 1} is the curvature index (k = 1 in the case of sphere). Saul 
Perlmutter calls the parameter Qh the mass density and Qa the vacuum energy 
density (see [279]). The parameter Q2x is called the curvature parameter in [289] 
(or the curvature density). The parameter Qy is also called the density of dark 
and baryonic matter and Q, the density of dark energy. Note that (6.28) is really 
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> : > 
t 0 t, t 


0 13 


Fig. 6.7 The expansion function for Einstein’s static universe, the cyclic universe (the expansion 
stops at some time f2 > 0 and then starts to shrink), the universe with zero cosmological constant, 
and for the currently proposed expansion of the universe with a positive cosmological constant. 
Here t4 = t denotes the time instant of the origin of the cosmic microwave background radiation 


a differential equation, since the derivative a is hidden in the Hubble parameter 
appearing in (6.29). 


Remark 6.12 The radius a of Einstein’s static universe follows directly from the 
Friedmann equation (6.26). For A > 0 the Einstein static solution (see Fig. 6.7) 


a(t) = Aq! 


solves the Friedmann equation for p = Ac? /(47G). Since a(t) = 0 the mass 
density parameter Qy attains by (6.27) and (6.29) an infinite value, even though 
nothing unusual happens. We should properly say that this parameter is not well 
defined as its denominator is zero. We divide by zero also in the case of the other 
two parameters from (6.29). 


Remark 6.13 By (6.29) we see that 


ath>o0 = Qm(t)> co and Qa(t) > +co for A#0, 


corresponding e.g. to an oscillating universe, which is also called a cyclic universe 
(see Fig. 6.7), where the expansion function repeatedly increases and then decreases 
[348]. The reason is that the derivative a vanishes at the point of maximum. Hence, 
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0 a t 0 i 


Fig. 6.8 Graph of the expansion function a = a(t) for (1) a loitering universe for which there 
exists tf) > 0 such that a(t,) = 0, and (2) a bouncing universe, where a(0) = 0 


when the density of dark energy reaches infinity, the universe starts to shrink which 
is absurd. The names of cosmological parameters are thus confusing. Is there really 
an infinite density of dark and baryonic matter and vacuum dark energy, when a = 
a(t) reaches its extremal values? 

Let us emphasize that the measured density of baryonic matter is always finite. 
This shows that the Friedmann model is strange, since the ratio between dark and 
baryonic matter may attain arbitrarily large values. Cosmological parameters have a 
strange behavior also in other cases [196]. 


Similarly one divides by zero also for the loitering universe (see Fig. 6.8) or in 
the de Sitter solution a(t) = a cosh(ct/a) for o = 0 and a suitable constant a > 0, 
see e.g. [59, 80, 94, 211]. Note that in the latter case the expansion rate of a(t) is 
superluminal for large ¢. 


Remark 6.14 At present there is a large number of papers on the so-called precise 
cosmology that categorically state (see Fig. 6.9): 


The universe is composed of 26.8% of dark matter, 4.9% of baryonic matter, and 68.3% of 
dark energy. 


Rightly they should claim: 


According to the standard cosmological model based on the Friedmann equation, the 
universe could consist of about 27% of dark matter, 5% of baryonic matter, and 68% of 
dark energy. 


It is very important to realize the fundamental difference between these two state- 
ments, since the Friedmann equation (6.28) was derived by excessive extrapolations, 
see e.g. [196, 198, 360, 361]. Therefore, we will now look in more detail as to how 
the above percentages were obtained. 


Remark 6.15. The current values of cosmological parameters are determined, for 
example, by very distant explosions of type Ia supernovae, which are treated as so- 
called standard candles (for details see [279, 299]). Type Ia supernovae have about 
10-15% less luminosity than if the expansion of the universe were to decelerate 
only by gravity. From this it can be concluded that the supernova light spreads into 
a larger volume, and therefore the expansion of the universe is accelerating, i.e., 
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Dark 
Baryonic matter en ergy 


Fig. 6.9 Results of the Planck satellite [289] are interpreted in such a way that our universe 
consists of about 26.8% of dark matter, 4.9% of baryonic matter, and 68.3% of dark energy. 
However, they are based on the normalized Friedmann equation (6.28), which was derived using 
excessive cosmological extrapolations by many orders of magnitude, leading to the relation 
Qy + Qa *& 1. Cosmological parameters are sought so that the solution of (6.28) is as close 
as possible to the measured data. The amount of baryonic matter is estimated by the amount of 
luminous matter [371, p. 74] 


a = a(t) is a strictly convex function. Another explanation is given in [358, 359] by 
Vaclav Vavry¢éuk. 


Remark 6.16 By [22, 305] we should consider the extinction of light by host 
galaxies. The measured luminosity depends considerably on whether the supernova 
lies inside or on the edge of the galaxy, the direction of its rotation axis, etc. Thus, 
type Ia supernovae are standard candles only very roughly. 


To determine the distances of supernovae (see [146], [299, p. 1021]) the follow- 
ing formula from [59, p. 511] derived from the Friedmann equation for the so-called 
luminosity distance is used (see (7.16) and Remark 7.17 below for more details) 


cda+z) . im dz ) 
d Q : 
L (Vi K| b Ja 


= ——! Sin 
ApV/|2k\ +7)7(1 + Qmz) — 2(2+7)Q4 
(6.30) 


Here 


70 
Hoy = H(to) © 70 kms~'!Mpe~! = ————__ s~! = 2.27.107!8 s-! (6.31) 
3.086 - 1019 
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Fig. 6.10 Admissible values 
of the cosmological 
parameters Qy and Qa, 
obtained by three different 
methods: BAO (Baryonic 
Acoustic Oscillations), CMB 
(Cosmic Microwave 
Background), and SNe 
(Supernova explosions). 
However, these methods are 
not independent, because they 
are all based on the 
Friedmann equation (6.26) 
derived by questionable 
cosmological extrapolations 


0.0 020304 06 O08 1.0 


is the current value of the Hubble parameter [339] at time t = to which corresponds 
to the present. We will call it the Hubble-Lemaitre constant, since by the [AU 2018 
Resolution B4 in Vienna, the expansion of the universe should be referred to as the 
Hubble-Lemaitre law. Moreover, Qk = 1 — Qu — Qa by (6.28), where for brevity 
the actual values of the cosmological parameters are denoted by 


QM = Qu (to), Qa = Qato), and LK = Ox (Zo). 


From this, the measured values of the redshift z of absorption spectral lines of 
silicon, and the luminosity of type Ia supernovae we can determine particular values 
of cosmological parameters (6.29) by the x? test [299, p. 1021]. The resulting set of 
admissible values of cosmological parameters is marked by SNe in Fig. 6.10. In fact, 
the questionable Friedmann equation is solved by three different methods [279]. 


Remark 6.17 In 1927, Lemaitre [218, p.56] obtained the value Hp = 625 
kms~! Mpe~! which is much larger than that of (6.31), since he had no exact data. 
In 1929 Hubble [134] got Hy = 500 kms! Mpc™!. 


Remark 6.18 By Remark 7.4 below the second denominator in (6.30) is positive 
for Quy => 0 and Q, < 1. Furthermore, note that Weinberg [369, pp.421, 461] 
presents different relations of the luminosity distance independent of the Friedmann 
equation. 
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Remark 6.19 The sum of the measured values Qy and Q, is approximately equal 
to | (see (6.28) and Fig. 6.9). However, this does not allow us to claim that the true 
space is flat (i.e. infinite Euclidean) as it is often stated at present. Even if the sum 
were to be 


Qm + Qa = 1.0000000000001, 


we would still have a bounded hyperspherical (i.e. elliptic Riemannian) universe that 
can be described by the sphere S3 with an incredibly large radius r. Such a space 
is locally almost Euclidean, but finite. There is a big difference between a bounded 
and unbounded space. Moreover, the sphere 5? has an entirely different topology 
than the flat space E? which is promulgated by cosmologists at present. Testing 
the equality Qy + a = 1 resembles an inconvenient and incorrect computer test 
X = Y in single-precision arithmetic (4 bytes), where X and Y are declared to be 
real (not integer). 


Remark 6.20 The fact that H = H(t) is a decreasing function does not contradict 
the statement that the expansion of the universe could accelerate, i.e., that the second 
derivative @ > 0 on some interval, where a is a strictly convex function. To see 
this, take e.g. a(t) = t? on I = (0,00). Then a(t) = 21, a(t) = 2 > O, and 
H(t) = a(t)/a(t) = 2/t is a decreasing function on /. 


Remark 6.21 There are several serious drawbacks of the cosmological parameters 
defined by (6.29), in particular: 


1. For the Einstein stationary universe we have a = 0, see Fig. 6.7. Hence, H(t) = 
0 for any time instant and the density of dark and baryonic matter Qh) = oo 
which is absurd. 

2. We encounter division by zero as stated in Remark 6.13. 

3. For a parabolic space the first derivative of the expansion function ad = a(t) is 
decreasing, but then the function t b |Qx(f)| is increasing in time. Is not it 
strange? 

4. If the expansion function was first strictly concave and then strictly convex (since 
at present the universe accelerates by the standard cosmological model), then 
a = a(t) is not a monotone function. However, then the curvature parameter 
t +» Qx(t) is also not monotone. This is also strange, since the universe is 
expanding all the time. 

5. Another paradox concerns the Big Bang itself. Since a(t) > oo fort > 0°, 
we find that Qx(t) — 0. However, Qx(tg) = 0 is proposed by cosmologists 
also at the present time fg. Moreover, note that the Big Bang event contradicts 
the principle of causality. 


Remark 6.22 There are also essential problems with predicted and measured values 
of the og cosmological parameter. This parameter describes the effective value of the 
mass density fluctuations on a sphere of radius 8h~! Mpc for h € (0.5, 0.75), where 
h depends on the currently accepted value of the Hubble-Lemaitre constant Ho. For 
a substantial discordance in estimating og see [130]. 
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Remark 6.23 In 2016, the Planck Collaboration stated from CMB data that 

Ho = 66.93 + 0.62 kms~!Mpce™!. 
This present value is based on questionable long-term extrapolations applied to the 
standard cosmological model (i.e. Einstein’s equations), while the Gaia and Hubble 


Space Telescope measurements of Cepheids and RR Lyrae yield a 10% larger value 


Ho = 73.52 = 1.62 kms~!Mpc™!, 


see [301]. We again get a disagreement between observations and the theory based 
on the Friedmann equation. 


6.8 Maximally Symmetric Manifolds 


By Einstein’s cosmological principle our universe at any fixed time seems to be 
isotropic and homogeneous on very large scales. Since its global curvature is not 
definitely established yet, it is usually modeled by one of the following three three- 
dimensional maximally symmetric manifolds: the sphere S*, the Euclidean space 
(3, or the pseudosphere H?’. 


Theorem 6.2 For any dimension n > 1 there exist exactly three maximally 
symmetric manifolds up to scaling, namely S", E”, and Ht". 


For the proof see e.g. [369, Chapt. 13] and [277, p. 721]. The curvature index of 
the above three maximally symmetric manifolds is k = 1,k = 0, andk = —1, 
respectively. 


Remark 6.24 Already in 1900, Karl Schwarzschild [320] speculated that our 
universe could be possibly hyperbolic. In 1924, Alexander Friedmann (1888-1925) 
published another famous paper [109], where a negative curvature index k = —1 is 
considered, cf. (6.29). He suggests that only negative mass density is possible for 
a Static universe (see [109, p.2006]) and thus it is not clear how to satisfy such a 
paradoxical requirement. Let us also point out that Friedmann did not consider the 
case k = 0 corresponding to the three-dimensional Euclidean space E?. 


From Einstein’s equation we can derive (see Remark 6.11) the following ordinary 
differential equation for the expansion function a = a(t) 


8G 2 A 2 2 
ee — 4 = ke. (6.32) 


This is the Friedmann equation for any curvature index k € {—1,0, 1}, A is the 
cosmological constant, and p = p(t) > 0 is the mean mass density. At present it is 
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assumed that (6.32) should be considered only for 
t>T, 
where 


t * 380000 yr (6.33) 


is the time of decoupling of the cosmic microwave background radiation (CMB) 
when matter started to dominate over radiation, see e.g. [97, 290]. It is estimated 
that this decoupling lasted approximately 10,000 years. 


Remark 6.25. Let us emphasize that the Friedmann equation (6.32) was derived 
exactly from Einstein’s equations (3.52) without any approximations for a homoge- 
neous and isotropic universe which is described by maximally symmetric manifolds. 
Therefore, Alan Guth [122, p. 348] calls the Friedmann equations (6.22) and (6.24) 
the Einstein equations. 


1 


Remark 6.26 For n = | the space E! can be isometrically imbedded into H and 
vice versa. Nontrivial hyperbolic geometries thus exist only in dimensions n > 1. 


Now we present further relations between Euclidean space and the pseudosphere 
(see [42, p.3]). 


Theorem 6.3 The space E” can be isometrically imbedded into H"*!. 


Converse imbeddings are not so simple. The hyperbolic plane is an unbounded 
two-dimensional manifold which curves away from itself at every point. It has a 
somewhat counter-intuitive geometry which is difficult to imagine. In 1901, David 
Hilbert proved that the hyperbolic plane H? cannot be isometrically imbedded into 
i> (see [128]). His statement is usually formulated as follows: 


Theorem 6.4 (Hilbert) There is no complete regular two-dimensional manifold of 
negative constant Gaussian curvature imbedded into E?. 


In 1955, Danilo Blanua proved that the hyperbolic plane Hi* can be isometrically 
imbedded into the space 2° (see [34]). Therefore, if we would transform ourselves 
into six-dimensional beings in 06 (with Euclidean distance), we could admire the 
beauty of the symmetry of the pseudosphere Hi, similarly to the way in which as 
three-dimensional beings we enjoy the beautiful symmetry of the sphere S*. For the 
time being it is not known whether the dimension six can be reduced. BlanuSa’s 
assertion was generalized by David Brander [42] as follows: 


Theorem 6.5 For n > 1 the pseudosphere Hl" can be isometrically imbedded into 
;6n—6 


It is also not known whether the exponent 6n — 6 can be reduced. Brander [43] 
however proves that there exists a local isometric imbedding from HI” to E7”—!. The 
manifold S*, resp. HI’, which possibly model our universe at a fixed time, can by 
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previous statements be isometrically imbedded to the Euclidean space E’, resp. E!”, 


1.e. 


Sok, Boz. (6.34) 


Here the symbol — denotes the isometric imbedding. In 12-dimensional Euclidean 
space the distances in H* will be undeformed. The pseudosphere H? is a rather 
exotic object and as 12-dimensional beings we could convince ourselves about its 
maximal symmetry. Visualization and construction of the pseudosphere H? without 
any deformations of distances is therefore extremely difficult. On the other hand, it 
is easier to imagine the manifolds E* and S? which are further candidates for the 
model of the universe and which do not contain any privileged points and directions. 

In [192], we prove the theorem that any n-dimensional manifold in E”*! which 
has the same curvature at any point and any principal direction is necessarily 
the sphere S? or the hyperplane in i"+!_ Since the pseudosphere HI” cannot be 
imbedded into E”+! for n > 1, the assumptions of this theorem are not satisfied for 
the manifold H”. 


Remark 6.27 The manifolds E* and Hi are used to model an infinite isotropic and 
homogeneous universe in the standard cosmological model. However, the case that 
this unbounded universe would have at every point and every time instant almost 
the same temperature, density, pressure, curvature, etc., on large scales would be a 
miracle. This would require an infinite speed of information transfer. Therefore, the 
popular theory of inflation [122] cannot explain such a homogeneity and isotropy of 
unbounded infinite universes. On the other hand, the manifold S3 is always bounded 
(finite) and no inflation is necessary to assume, e.g. if a(t) © a exp(ct/a) for some 
constant a > 0. 


Remark 6.28 In [182, p.9], we present another argument against Euclidean and 
hyperbolic geometry of the physical universe, namely, for the interior of a homoge- 
neous mass ball we find by (3.17)-(3.19) that 


1 
di? = =o dr* + r7d6? +r? sin® 6 dd’, (6.35) 
=—TFr"s 


where rs € [0, 1), 0 € [0, x], and ¢ € [0, 277). This relation is very similar to 


di? = 


== dr? + r7d6? +r? sin* 6 dg” 
for the unit sphere S* with the curvature parameter k = 1 (see [369, p. 403] and also 
(6.3)). Here r € [0, 1) is a dimensionless parameter, 0 € [0, 7], and g € [0, 27). 
We observe that the cases k = 0 and k = —1 do not match (6.35) when s > 0. Thus, 
a homogeneous mass distribution causes a positive curvature globally and the most 
probable model of our universe at any fixed time is See 
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6.9 Regular Tessellations of Hyperbolic Manifolds 


Due to Theorem 6.5, a natural question arises: How can we imagine hyperbolic 
manifolds H”? One way is to tile them with congruent bounded regular polyhedral 
cells. These cells are curved, of course. For simplicity, we shall often omit the term 
“regular” in what follows. First consider n = 2 corresponding to the hyperbolic 
plane. 


Remark 6.29 The following often presented model of a hyperbolic plane 


72 “3, 9 2 2 
AY = {(x1, x2, x3) € EX | xp +49 — x3 = —-]} 


is confusing and thus not suitable for our purpose, since the variables x1, x2, x3 are 
not equivalent and the used Minkowski-like pseudometric is not natural. Another 
often sketched representation of the hyperbolic plane as the graph of a saddle-point 
is also not suitable, since this graph has different curvatures at different points and 
different directions, in general. 


According to Hilbert’s Theorem 6.4, the hyperbolic plane should be deformed 
somehow to get some idea of how its regular tessellations look like. There are at least 
six basic ways to perform such a deformation (see [54]). One way is to use the well- 
known Poincaré disc in which all angles between geodesics in H* are preserved, 
but distances are not preserved. This model comes from a stereographic projection 
of the hyperboloid in E”+! onto the hyperplane x,+, = 0. The hyperbolic plane 
is represented by the interior of the unit circle k in E?. Geodesics are then circular 


arcs that are orthogonal to k, see Figs. 6.11 and 6.12. The corresponding metric is 
defined in [184, p. 273]. 


Peep 


PS 
Z| 


on 


Fig. 6.11 The regular tessellations of the hyperbolic plane by equilateral hyperbolic triangles with 
vertex degree d = 7 andd = 8. All sides on the left (resp. right) picture have the same length in 
the corresponding metric (Picture courtesy of Petr Kirka) 
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Fig. 6.12 The regular tessellations of the hyperbolic plane by hyperbolic squares with vertex 
degree d = 5 andd = 6. Every vertex angle is equal to 72° and 60°, respectively, and all sides 
have the same length in the corresponding metric. (Picture courtesy of Petr Kirka) 


Theorem 6.6 There exist infinitely many different tessellations by equilateral 
hyperbolic triangles of the hyperbolic plane TH’. In particular, for every d > 7 
there exists one equilateral hyperbolic triangle with angles 21/d that tessellates 
the hyperbolic plane. 


The cases d = 7 and d = 8 are illustrated in Fig. 6.11. 

There are also infinitely many different regular tessellations of the hyperbolic 
plane Hi? by other curved hyperbolic regular polygons, see e.g. Fig.6.12. In 
particular, in addition to Theorem 6.6 we can tessellate Hi? by: 


¢ by hyperbolic squares with d > 5 (see Fig. 6.12), 

¢ by hyperbolic pentagons with d > 4, 

¢ by hyperbolic hexagons with d > 4, and 

¢ by hyperbolic heptagons, octagons, etc., with d > 3. 


In all these cases, d is also the number of hyperbolic polygons meeting at each 
vertex. 

Concerning the hyperbolic space H’, it can only be tessellated by several types 
of curved congruent regular polyhedra up to scaling, translation, mirroring, and 
rotation. We call them regular tessellations. They were found by Harold Scott 
MacDonald Coxeter. 


Theorem 6.7 (Coxeter) There are only four regular tessellations of H°, namely, 


by a hyperbolic cube, by one of the two types of hyperbolic dodecahedra, and by a 
hyperbolic icosahedron. 


For the proof see [72]. For the corresponding space-fillers see Figs. 6.13 and 6.14. 
The next theorem is proved in [44, p. 167]. 
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Fig. 6.13 A schematic illustration of the cubic space-filler cell of H? whose dihedral angles are 
72° (From Wolfram MathWorld) 


Fig. 6.14 Schematic illustrations of hyperbolic dodecahedral and icosahedral space-filler cells of 
HH? (From Wolfram MathWorld) 


Theorem 6.8 There are no regular simplicial tessellations of E” and Hl” forn > 3. 
Remark 6.30 A modified version of the hyperspherical coordinates from Sect. 6.2 
is sometimes used to express the unit hyperbolic pseudosphere Hl’, in particular, 

x = sinh x sin@ cos @, 

y =sinhx sind sing, 

z = sinh x cos@, 


w =cosh x, 
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where x € [0, 00), 6 © [0, 7], and @ € [0, 277). However, this expression is again 
not a very intuitive model of our isotropic universe at a fixed time, since 


e+y42—y?=-1 
and these variables are not equivalent. In different points and different directions 


the corresponding curvature is not the same. The use of the Minkowski-like 
pseudometric is not natural in this case, because w is not a time variable. 


Chapter 7 ®) 
Excessive Extrapolations From om 
the Friedmann Equation 


7.1 Typical Examples 


The contemporary cosmological model which is based on the Friedmann equation 
(6.28) does not give acceptable results. For instance, it admits a division by zero, 
see Remark 6.13 and Figs. 6.7 and 6.8. It also leads to a wide range of paradoxes 
and has other serious problems, in particular, the problem of the existence of an 
invisible exotic dark matter and dark energy (see Sect. 1.1). The main reason is 
that physical reality is identified with a very simple mathematical model (see (7.21) 
below) ignoring the modeling error which can be quite large. 

Now let us look at these facts in detail. In this section, we present several 
illustrative examples that show why we should be very careful in making diverse 
unsubstantiated and unjustified extrapolations in cosmology. 


Example 7.1 Applying Newtonian theory to the entire homogeneous and isotropic 
universe leads to the well-known Jeans paradox. The corresponding Newtonian 
potential @ = const. (see [141]) should satisfy the Poisson equation 


Ad = 41Gop, 


where A = 07/dx* + 0*/dy* + 47/0z? is the standard Laplace operator and p > 0 
is a constant mass density. However, we immediately see that the left-hand side is 
zero while the right-hand side is positive. Einstein knew about this paradox. That is 
why he was inspired by the equation Ag — Ag = 42 Gp which has a non-constant 
solution for A > 0. In this way he arrived at equation (3.52). 


Example 7.2 Consider a unit cube with edge e = 1 m (see Fig. 7.1) and solve a 


steady-state heat conduction problem 
—Au=f 
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 139 
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Fig. 7.1 A homogeneous 
isotropic cube with edge of 
length e 


with some boundary conditions, where u is the temperature and f is proportional 
to the density of heat forces. This elliptic problem approximates very well the 
true temperature in homogeneous isotropic solids which can be verified by direct 
measurements [191]. However, in applying the heat equation on the atomic level in 
the cube with edge e = 107! m, we get nonsensical numbers, since it is not clear 
how to define the temperature on such a small scale. 


Example 7.3 We also obtain meaningless numbers for e = 10!° m. Such a large 
cube from Fig. 7.1 with density p ~ 1000 kg/m? would immediately collapse into 
a black hole, since the diameter /3e of this cube is about ten times larger than the 
diameter 1.4 - 10° m of the Sun. We can, of course, solve the above steady-state 
heat conduction problem on an arbitrarily large cube. However, the question is for 
which e do we still get acceptable results, and when the resulting temperatures have 
nothing to do with reality. 


Other equations of mathematical physics (such as supraconductivity equations, 
Navier-Stokes equations for fluids, Korteweg-de Vries equations, linear elasticity 
equations, Maxwell’s equations, semiconductor equations, magneto-hydro-dynamic 
equations, and so on) are subjected to analogous restrictions. 


Remark 7.1 Time scales cannot be arbitrary, too. For instance, the backward 
integration of the N-body problem can yield reliable results in the Solar system 
on time scales 10° years, whereas on scales 10!° years it produces definitely 
wrong results, since the Solar system did not exist 5 Gyr ago. Therefore, we also 
cannot predict, e.g., how many inhabitants will be on Earth after 1 Myr or to predict 
any reliable weather forecast after 1 yr. Note that any climatological model is much 
more simple than Einstein’s equations (cf. (5.3)). 


7.2 The Origin of Excessive Extrapolations 


In 1922, cosmologists had no idea about the actual size of the physical universe, 
since other galaxies were discovered later by Edwin Powell Hubble. In spite of that 
Alexander Friedmann [108] implicitly assumed that Einstein’s equations describe 
exactly the behavior of gravity on cosmological scales. However, this hidden 
assumption is highly speculative, since Einstein’s equations (more precisely only 
some simple formulae derived from the parametrized post-Newtonian formalism) 
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“are tested” on the scales of astronomical units and excessive extrapolations are 
made to the entire universe that is at least 1,000,000,000,000,000 = 10!> times 
larger than | au. 


Remark 7.2. The well-known relationship s = vt from Newtonian mechanics is 
scale invariant in Euclidean space. However, it is not scale invariant in the physical 
universe, since any velocity v never exceeds c. Einstein’s equations also cannot be 
scale invariant on an expanding sphere ae since it is bounded. Furthermore, they 
do not trustworthily describe phenomena at the atomic level which also implies 
that they are not scale invariant, cf. Remark 1.12. In this case, the modeling error 
is very large (see the right part of Fig. 7.3). However, it is tacitly assumed that 
Einstein’s equations describe reality very well on arbitrarily large scale. Einstein’s 
equations also cannot be used to model an extreme state just after the Big Bang 
[122] when matter and antimatter collided, highly energetic photons appeared, and 
the temperature thus briefly increased [37, p. 113]. Additionally, they contain fixed 
fundamental physical constants c and G, are fully deterministic and nonlinear. 


Example 7.4 To see why nonlinearities break scale invariance, consider a simple 
nonlinear relation between the magnetic intensity H and the magnetic induction B 
as sketched in Fig. 7.2. This represents a typical behavior of ferromagnetic materials 
contained in electromagnets, transformers, rotary machines, electron microscopes, 
etc., see [191, p. 133] for details. We observe that this relation is linear on very small 
scales near the origin. On large scales it is also linear, but the slope is different. 
Hence, this relation cannot be scale invariant globally. Similar considerations can 
also be done for other kinds of nonlinearities describing, e.g., elasto-plastic material 
(with hardening), nonlinear heat conduction, or Einstein’s equations. 


Remark 7.3 We present one more argument indicating why Einstein’s equations 
should not be applied to the entire universe. The present state of our universe 
depends on its history, i.e., its evolution is surely not reversible. On the other hand, 


|| Bl [7] 


|| Z| [A/m] 


10000 20000 30000 


Fig. 7.2. A typical nonlinear dependence of magnetic induction B on magnetic intensity H 
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Fig. 7.3. Left: A schematic illustration of a general behavior of the relative modeling error E 
for equations of mathematical physics. The horizontal axis has a logarithmic scale and p is the 
exponent for which the modeling error is the smallest. Right: Behavior of the modeling error for 
Einstein’s equations proposed by the current standard cosmological model 


the Friedmann equation is completely reversible. That is its solution does not depend 
on the history, but only on one prescribed value of the expansion function at some 
given time instant (e.g. the present time), since it is an ordinary differential equation 
of the first order, see (7.21) below. 


From Fig. 1.9 we know that no equation of mathematical physics describes reality 
absolutely exactly on any scale. The reason is that the laws of physics are not 
unchanged under a change of scale, in general. Each mathematical model has, in 
general, a different behavior of the modeling error. In the left part of Fig. 7.3, we 
see a typical behavior of this error for different scales. Therefore, it has no sense 
to look for a model that would perfectly describe reality on any scale (micro and 
macro together). 

Therefore, we have to study models that are independent of the Friedmann 
equation (see e.g. [183-185], [271, p.299]) to check whether dark matter exists 
or not. If dark matter does not exist, then the relation (6.30) derived from the 
Friedmann equation can hardly yield some reliable distances. We will return to 
this important problem in Sects. 7.3 and 7.4. The standard cosmological model thus 
resembles the situation of article with the title Lemma 1, see [151], which can be 
characterized as follows: 


Suppose that Lemma 1 implies Lemma 2, from which we further derive Lemma 3. These 
auxiliary results imply a certain important mathematical theorem, which is a basis of a new 
fascinating and beautiful theory. But after some time we find that Lemma 1 is wrong, and 
therefore the theorem need not hold. 


In contemporary cosmology such an assertion as Lemma | is the statement that 
Einstein’s equations describe the evolution of the entire universe very well. This 
hidden assumption is considered to be obvious. However, it is just a null hypothesis 
which has not been verified by any experiment and was obtained by excessive 
extrapolations by many orders of magnitude in the articles [94] by Einstein and 
[108] by Friedmann. Since the twenties, this fatal mistake stretches throughout the 
entire cosmology. Therefore, from a similar reason it also makes no sense to look 
for a theory of everything. 
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7.3 Distances in Time 


In this section, we derive a relation between the redshift z and the so-called look- 
back time (see (7.9) below) by means of the Friedmann equation, i.e., how much 
time has passed before the photons from the observed object reached us. We shall 
follow the idea of Carroll et al. [59] from 1992 in more detail. 

For z > 0 we introduce an auxiliary function 


a(t) _ 1 


ae eee 


(7.1) 


where fo is the age of the universe. By the law of conservation of mass and energy 
we find for zero pressure that 


p(t)a>(t) = p(to)a> (to) 2) 


is a time independent constant for nonrelativistic matter in matter dominated 
universe with z < 1000. Thus we have 


p(tyar(t)  p(t)aX(t) _ plto)a*(to) _ plto)a(to) _ p(to) 
ar(to) — ar(toa(t)—a(t)at) at) X(t) 


From this and the Friedmann equation (6.32) divided by a’ (to), we get 


82Gpa2 Ac? X?(t) ke? 8Gp(to) | Ac*X*(t) ke? 


ae 
ia 3a?(to) * 3 a2(to.) 3X(t) 3 a2(to) 


Dividing this equation by the Hubble-Lemaitre constant 


A agg i! 
= ° ~ a(t) lt=t0 
we obtain 
1. Qm 
ij ay Gl Oe, 73 
BE O = Fey t MAO + Me (7.3) 


where for brevity 


QM = Qu(to), Ln =La(to), and QxK = Lx (to) (7.4) 
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denote the present values of the cosmological parameters (6.29). The left-hand side 
of (7.3) is nonnegative and thus 


1. 
paar et fe 2 
ne | 2 + Q, X*(t) + Qg. (7.5) 
Since by (7.1), 
: d 1 1 dz 
ae) rca 7) = (z+ 1)? dt 


and since the derivative dz/dt is negative, we find from (7.5) that 


ee Se (| es (7.6) 
Bp “Retiza y “MY @+ie 
and thus, 
dz 


= —dt. (7.7) 
Ao(z + 1)fQuz + D3 + Q(z + D2 + Qy 


By (6.28) we can exclude Qk from the denominator using the identity 


Qu(z + 1)? + (1 — Qu — Naz + 1? + Qa =(Qurz + (z+ VD? — Qazz +2). 
(7.8) 


Now by integration of (7.7) we get the relation between the redshift z and the look- 
back time t, 


ee: [ dy as 
ee Fede, G+ 1) Ouz + Det De — Gaze +2) 


We see that the actual look-back time essentially depends on the fact whether 
the Friedmann equation describes well the evolution of the universe. Numerical 
relations between the redshift and the corresponding look-back time are given in 
[284] for k = 0. 


Remark 7.4 If 


QM >O0 and Qa, <1, (7.10) 
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then the integral in (7.9) exists and is finite. These additional conditions are 
essential, because then the difference under the square root in (7.9) 


(OmzZ + E+ Y? — M742) > E+ VY? -7E42) =1 


is positive (cf. Remark 7.15 and Example 4.1). 


Remark 7.5 According to [59, p.509] and also [299, Sect. 4.3], the formula (7.9) 
for z — oo gives the age of the universe only by means of the present values of the 
cosmological parameters Quy, &,, and Ho. However, this need not be true, since 
(7.9) was derived under the assumption (7.2) which is not valid for very large z > 
1000. According to [383, Chapt. 2], for ultrarelativistic matter we have p (t)a*(t) = 
const. fort € (0, t) which corresponds to z>1000 in a radiation dominated universe. 
The use of (6.32) for this case has to be done with special care. 


Remark 7.6 In applying the current standard cosmological model various “delicate” 
limits are performed: a > 0,a ~ w,t > 0,t ~ w~,z > w,... (see, for 
instance, [249, 369]). At present it is believed that a(t) — oo fort > oo. By (6.28) 
and (6.29) for k < 0 it follows that 1 > Q(t), and thus } Ac? < H(t) for arbitrary 
time. From this and (6.27) we observe that also the time derivative of the expansion 
function grows beyond all bounds if A is a positive constant. Hence, the derivative 
a(t) > oo for tf > oo in an infinite universe (Euclidean or hyperbolic). 


Remark 7.7 The present measured value of the Hubble-Lemaitre constant is 
Hy © 70 kms~! Mpe7! © 2.27- 107!8 s7!, (7.11) 
where | pe = 3.086 - 10!© m, Quy * 0.3 and Qa ~ 0.7. According to [290], 
to = 13.82 Gyr, 


i.e., this age was derived up to four significant digits by formula (7.9). Nevertheless, 
from such a simple calculation we should not make any definite predictions or 
categorical conclusions about the actual age of the physical universe as is often 
done. The Friedmann equation was derived from Einstein’s equations by excessive 
extrapolations to cosmological scales. Its solution for f > t9 depends only on 
the value of the expansion function at the present time fo and not on the history. 
Since the terminal condition a(fo)/a(to) = Hp is known, we may evaluate a(fo) 
from equation (6.32) and then integrate (6.32) backward. From this we get the 
above value fg = 13.82 Gyr, see [198, p. 274]. Other models that are not based on 
the Friedmann equation may yield a different age of our universe, e.g. an infinite 
age for a(t) = Cie! + C3 and some suitable constants C; > 0, see [184, 
p. 305]. For instance, if C72 = Ho and C3 = 0, then H(t) = Hp is by (6.27) 
constant everywhere. Then no short-time inflation period just after the Big Bang 
is necessary for the curvature index k = 1, because the exchange of information 
about temperature, density, and pressure does not violate the principle of causality. 
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Fig. 7.4 If the expansion 
function a = a(t) were to be 
concave, then its graph would 
be below the tangent line 
through the point (fo, a(fo)). 
Thus for a(to) > 0 by (7.12) 
the age of the universe fo 
would not exceed the Hubble 
age Ty = 13.97 billion years 


a(ty) 


On the other hand, by the inflation theory of Alan Guth [122] the expansion function 
is firstly strictly convex (exponential), then strictly concave, and then strictly convex 
again for increasing time t. This seems to be unnatural. 


Remark 7.8 The Hubble time, sometimes called the Hubble age (see Fig. 7.4), is 
defined by the relation 


1 
To :-= — © 13.97 Gyr © 4.41- 10! s, 
Ho 


where | yr = 31558 149.54 s and the measured value of Hp is given in 7.11. The 
corresponding Hubble distance is then defined by 


dy := cT) = 1.32- 107° m. 


If the expansion function a = a(t) in its definition domain were to be concave 
everywhere (which probably does not correspond to reality) and a(fo) > O, then by 
(6.27) the following would hold for the true age fo of the universe (see Fig. 7.4): 


_ a(tg) _ 1 
= aay Gp 


Remark 7.9 If 0 < z < | then by (7.9) 


~ 13.97 Gyr. (7.12) 


va 
z= f dz = Ho(to — ft), 
0 


i.e., for a small z the look-back time ¢ is independent of Qh, and Q,. Multiplying 
this equation by c and using the classical Hubble law 


v = Aod, 
where v the recession speed of galaxies at distance d, we get 


v= CZ. 
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Hence, the distance can be calculated from the measured redshift z < 1 as follows 
(cf. [301]) 


ae 
Ho 


This relation is a very good approximation for z < 0.1, i.e., for distances not greater 
than 400 Mpc. Notice that the Hubble law predicts recession speeds faster than c of 
very distant galaxies. However, this does not violate STR, since they do not belong 
to our local inertial systems, see [79]. 


Remark 7.10 Tf Qx = Qa = 0 then by (6.28) and (7.9) 


dz —2 Zz 2 ] 
i G+)? — cer Dy 3(1 (+ aaa) = Ho(to — 1). 


From this and z > co we get Ho = 2/(3%0), Le., 


For instance, if z = 8 then (z+ 1)7/? = 27 and the corresponding age of the universe 
would be t = 710. In fact, this represents an upper estimate of the age f for nonzero 
cosmological parameters Qk and Q,. To see this, assume that (7.10) is valid. Then 


(Quz + 1)@+ 1)? — QaZZ+2) < E+), 


and thus by (7.9) 


S an dz 
t ti= 
: Ao Jo (7+ 1) V(Quz + DEF 1)? — QazZ +2) 


> an de =f t 
- Holo @+IP2 


Pas a 
that ist <t = yl. 


Remark 7.11 Using (6.27) and (7.1), we get 


H(t) = 40), 4 


“a aay 
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From this, (7.6), and (7.8) we can express the behavior of the Hubble parameter 
under the assumption (7.10) as a function of the redshift z as follows 


H(z) = (z+ tat +1)+ G+? rey, + OK 


= HoV (Quz + D+ 1? — Qaez +2), 


where H(z) = H(t) and z = a(to)/a(t) — | by (7.1). The inverse relation between 
z and ¢ can be derived from (7.9). 
Remark 7.12 Consider the linear equation of the second order 


. 4G 3p Ac? 
a= (6+ =F )a+ 4 (7.13) 


which is a direct consequence of the Einstein equations, see Remark 6.11. For a 4 0 
define the dimensionless deceleration parameter by 

da H 

. = 1 ’ 
( a)? H2 


q= q(t) := 


where the last equality follows from (6.27). In particular, if the pressure p = 0, then 
q can be expressed by means of the cosmological parameters (6.29) as 


4nGp Ac? 1 
qt) 3H 3H2 = 2 M(t) — Qa(t) 


due to (7.13). For Quy * 0.3 and Q, * 0.7 the present value of the deceleration 
parameter g(fo) *~ —O0.55 is negative, and thus the expansion of the universe is 
accelerating. Thus, the energy of the entire universe is not conserved. 


Remark 7.13 The current standard cosmological model assumes that time flows 
completely uniformly from the Big Bang on (see Fig. 7.4). Therefore, we often hear 
the question: What was before the Big Bang? 

However, it is important to realize that in the observable universe we actually 
look in any direction into the vast spacetime singularity. The more distant the 
observed objects are, the more it seems to us that time passes slower. If there 
were a huge clock placed at z = 1 from the Earth, then we would see that it goes 
twice as slow. The largest currently observed distance corresponds to the CMB with 
redshift z = 1089 which appeared 380,000 years after the Big Bang. At that time the 
universe was 1090 times smaller and the radius of the corresponding CMB sphere 
was about 10*4 m. The most typical diameter of fluctuations in the CMB is about 
one angular degree which is about 107! m in diameter, i.e. greater than our Galaxy. 
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Note that this is too large an object for the well-known method of baryonic acoustic 
oscillations, cf. [97]. 

During one earthly hour we would observe that the CMB clock had moved only 
about 3.3 = pon s. If we would ever see objects more distant than z = 1089 (using 
e.g. detectors of relict neutrinos), we would find that time there passes even more 
slowly, and so on. To avoid this drawback, a conformal time is usually applied, see 
(7.17) below. Thus the Big Bang singularity deforms not only space but also time. 
This is a similar phenomenon to time dilatation in a gravitational potential hole, 
since photons reduce their frequency. 


7.4 Classification of Cosmological Distances in Space 


In cosmology many different distances are defined: angular, comoving, Euclidean, 
Hubble, light-year, luminosity, Minkowski, parallax, proper motion, redshift, 
... distance, see [189]. There is no single natural distance, since the universe is 
expanding, curved, and we look back in time. In this section we will concentrate on 
geometrical interpretations of the above-mentioned distances. 

By [369] the angular distance (sometimes called the angular-size distance) is in 
Euclidean space defined by the relation 


: (7.14) 


where D is a known (or assumed) proper size of an object and 6 € (0, z) is its 
apparent angular size (see e.g. Fig. 1.7, where D = 2r sin(é/2) * ré). Since 6 is 
usually very small, we have tan 6 © sind © 6 in radians. 


Remark 7.14 Let an observer be located at the North Pole of the sphere S? in 
Fig.6.3. The arcs of the parallel +60° between the meridians 0° and 30° are 
obviously viewed under the same angle. A similar nonuniqueness may cause some 
troubles when using the angular distance (7.14) in S*. On the other hand, a very 
long and distant segment in HI* can be viewed under a very small angle. Since the 
actual curvature index k is not known up to now, one has to be careful when using 
the angular distance in cosmology for large z. 


The proper motion distance (sometimes called only the proper distance or the 
comoving transverse distance) is defined by the relation 


dm = a(to) f(x), (7.15) 
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where f depends on the curvature index k as follows 


sinxy ifk=1, 
fOI= 4x  ifk=0, 
sinh x ifk = —1. 
Note that for k = O the proper motion distance dy is the usual Euclidean 


distance. Only in this case the space manifold at any fixed time is not curved. We see 
that dy is always bounded for z — oo, since the maximum possible coordinate x is 
bounded. When k = 1 the formula (7.15) should be used by Remark 7.14 only for 
X < 1/2. For 2/2 < x < ma andk = | the proper motion distance dy is maximal 
for x = 2/2, and dy = 0 for x = mz, ie., the proper motion distance does not 
satisfy the standard mathematical requirements attributed to the distance, see [383]. 

Let L be the bolometric luminosity (i.e. total—integrated over all frequencies) of 
an object. It is measured in Watts. Then the luminosity distance dj, is related with 
the flux £ measured in W/m? by 


a ae Z 
=—,, ie. =,/—: 
Ande LY 470 


The luminosity distance in Mpc can be established from the observed apparent 
magnitude jz and the absolute magnitude j1aps of an object by the formula 


This is a key formula in cosmology [59], since one can use the observed flux to 
establish d_. The main idea is based on the known density of light flux, the distance 
and flux of some reference galaxy, and Pogson’s equation (for details see [184, 
p. 110] and [369]). 

According to [241, Eq. (6)], the luminosity distance is given by the formula 


d, = a(to) f(x) +z). 
Hence, the formula for the flux @ has to be modified as follows (see [369]): 


L 
© ain) FOO + De 


Since photons lose their energy during the expansion of the universe, the factor 
(z + 1) appears in the denominator. Another factor (z + 1) is due to the fact that 
one second lasted longer when a photon was emitted than when it was received. 
According to [369, p. 485], 


di (z) = (2 + Ido. (7.16) 
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Now we will derive the dependence of the proper motion and luminosity 
distances, respectively, on the redshift z. To this end, we will consider the conformal 
time n given by 


pee (7.17) 
Ta) : 


instead of the usual time f (i.e., 7 is linearly proportional to ¢ if a(t) =const.). Then 
from (7.7) and (7.1) we find that 


dz _ _alt)dn _ dn 
a(to)Ho(z + )JQu(z+ D3 +2K(Z+D2+N2, Cato) c(z +1) 


Multiplying both sides by z + 1, we get by (7.8) and integration as in (7.9) that 
dz 


c [ (7.18) 
a(to)Ho Jo V(Quz+ DE+ VY? — QazZ+2) 


X=" -)N= 


This is a dimensionless number that represents the distance in the comoving 
coordinates. The largest possible distance in x is thus Xhorizon = 10. Therefore, 
the observable universe is bounded. 

Note that the conformal time enables us to replace the term c7dt* by a?(t)dn? 
which makes the relation (6.5) more elegant. 


Remark 7.15 For admissible values of cosmological parameters Quy = 0.01, Q, = 
1.1 (cf. Fig.6.10), and z = 3, the difference under the square root in (7.18) is 
negative, because 


(Quz + 1I)(Z+ 1)? — QazZZ+ 2) = 1.03 -42 —1.1-3-5 = —0.02, 


which would yield complex values. In spite of that the standard Ned Wright’s 
Cosmological Calculator [396] gives a quite acceptable value d, = 164.1 Gly. Such 
a value is sometimes also called the light-year distance. 


From now on we shall assume that conditions (7.10) hold. These conditions are 
only sufficient (not necessary) for the existence of the integral in (7.18). 


Remark 7.16 Let k = 0. Then by (7.15), (7.16), and (7.18) we get 


(z+ lc dz 


R= Daor= / , 
L(z) = (z + la(to)x Ho 0 \Omet DE+I ~ Qc FD) 


Remark 7.17 By (7.4) and (6.29) we see that 


kc? 


Qk = —-———__. 
Hea (to) 
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Hence, by (7.15), (7.16), and (7.18) we get for k = 1 that 


dm (z) = a(fo) sin x 


Cc é dz 
ei a O . 
Ao |&2x| sin | «lf Se) 


For k = —1 we only replace sin above by sinh and for k = 0 the factors .//Qx] 
are shortened, see Remark 7.16. These formulae derived in [59] were also used by 
Perlmutter et al. in [280, p. 566]. Moreover, note that the observed flux essentially 
depends on the used measuring instruments. 


Remark 7.18 By [369, p. 485] the angular distance for any k € {—1, 0, 1} is defined 
as follows 


dm (Z) 


da(zZ) = ; 
A@) zt+1 


where the proper motion distance dy is given by (7.15). Hence, by (7.16), 


di (Zz) = du (z) = (z + Ida(z). (7.19) 
z+1 


For k = 0 the above angular distance coincides with (7.14) and on very short 
distances one may also define the standard parallax distance by means of a 
parallactic angle (see e.g. [369, pp. 420 and 485]). 

From (7.15) we see that d,(z) is unbounded for z + oo, dy(z) remains bounded 
for z — ov, and da(z) converges to zero for z — oo. Here one has to realize that 
the light of distant objects with redshift z > 0 propagates into a larger volume than 
if the universe were not to expand. This gives the factor (z + 1)~! on the left-hand 
side of (7.19). A valid criticism of formula (7.19) is given in [363]. 

Moreover, we must consistently distinguish between “at that time” and “actual 
dimensions”, i.e. the size of the universe when observed photons left a distant 
object and the universe was much smaller, and “today’s” dimensions when “ancient” 
photons arrived at our ground-based telescopes. Roughly speaking, the younger the 
objects which are observed, the larger the magnification appears. Therefore, by 
angular measurements we paradoxically see a very distant object as being larger. 
We call this phenomenon the time lens. The magnification is given by the factor 


(z+ 1), 


which appears on the right-hand side of (7.19). In [184, p. 314], the functioning of 
the time lens is illustrated by three real-world examples. 
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From (7.19) and Remark 7.9 we also see that 
CZ 
dy (z) © dm (Z) © da(z) © —— & c(to — t) 
A 
when z is small, i.e. 0 < z < 1. The corresponding error is only a few percent for 
zS0.1. 
7.5 A Simple Form of the Friedmann Equation 


Throughout this section we shall suppose for simplicity that 


A> 0. 


The case A < 0 can be investigated similarly, see e.g. [348]. The inequality A < 0 
leads to an oscillating universe, while for A = 0 the behavior of the expansion 
function depends on the so-called critical density p = oie /(82G) © 10776 kg/m3, 
see e.g. [184, p. 285], [231, p. 75]. 

The time instant t will again correspond to the decoupling of the cosmic 
microwave background (6.33). We shall assume that the expansion function a = 
a(t) satisfies the following condition 


a(t) > 0, (7.20) 


because the universe was expanding for t = T. 

Since the product p(t)a>(t) is constant (see (7.2)) during the time period when 
matter dominates over radiation, the Friedmann equation (6.32) takes the equivalent 
form 


Cc 
@ =Ae +B4— (7.21) 
a 
with time independent constant coefficients 


Ac? 82 Gpa? 
Age B = —kc?, ca 


>0 (7.22) 
which are not exactly known. This simplified notation will be useful in further 
analysis. 

Before we analyze the existence and uniqueness of the solution of (7.21) with 
some initial or terminal condition, we present several remarks, examples, and 
definitions. 
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Remark 7.19 Since the ratio 


a(to) 


a(to) 


= Ho = H(t) © 70 km/(s Mpc) 


for the present time fo is known, we may calculate from (7.21) and (7.22) the 
terminal condition ay = a(to) = cf [Ac2/3 —kHj + 82Gpo/3 ~ 10°° m. 
Thus we can solve equation (7.21) backward and also forward in time for given 
cosmological parameters (7.4). 


Remark 7.20 Since the expansion function a = a(t) > O is continuous, its time 
derivative a = a(t) is by Eq.(7.21) also a continuous function. Anyway, from 
such a simple ordinary differential equation we should not make any definitive 
conclusions about the deep past and the distant future of the universe as is often 
done. Equation (7.21) does not contain the term corresponding to radiation. The 
reason is that the present measured value of relative density of radiation is less than 
~ 5-107, compare with Fig. 6.9. However, fort < T, 1.e., for the time period when 
radiation dominated over matter (see Remark 7.5), another term D/ a’ is added to 
the right-hand side of Eq. (7.21). 


Remark 7.21 Vf a > 0 is sufficiently small, then C/a is much larger than the terms 
Aa’ and B on the right-hand side of (7.21). Hence, da ~ C and the corresponding 
density parameter QQ), ~ 1 also dominates the other parameters from (6.29). In 
this case the expansion function behaves as t7/> up to some multiplicative positive 
constant (as in the Einstein-de Sitter parabolic model with Qy = 1, Qa = 0, and 
Qx = 0). 


A function f: E! — E! is said to be Lipschitz continuous if there exists a 


constant L such that 


[f (x1) — f (x2)| < Llx1 —x2| Var, x2 € El. 


a 


Theorem 7.1 Let f: E! — E! be a Lipschitz continuous function and let yo € E 
be given. Then the problem 


y=fQ), y@O)=yo 


has exactly one continuously differentiable solution on E!. 


For the proof see e.g. [68, Chapt. 1] and [297]. The next two examples illustrate 
that if f is not Lipschitz continuous, then the global existence and the uniqueness 
of the solution in Theorem 7.1 are not guaranteed. 


Example 7.5 The function y(t) = tan y fort € [0, 2/2) solves the Riccati equation 
y = y? + 1 whose right-hand side is not Lipschitz continuous for y € (—00, 00). 
Thus we see that the existence of a global solution over [0,00) need not be 
guaranteed for right-hand sides that are not Lipschitz continuous. 
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Example 7.6 The equation y = ,/y with the initial condition y(0) = O has two 
solutions y(t) = qt? and y = 0. The right-hand side ,/y is not Lipschitz continuous 
at 0. 


Now we shall investigate the solvability of the Friedmann equation (7.21). Since 
this equation is autonomous, it does not matter in which point the initial condition 
is prescribed. 


Theorem 7.2 [f the curvature index k < 0 and (7.20) holds, then (7.21) with an 
arbitrary initial condition a(t) > O has exactly one continuously differentiable 
solution which is increasing over the interval [t, o&). 


Proof The right-hand side of (7.21) 
2 C 
F(a) = Aa“ +B+—, ae(0,ov), (7.23) 
a 
is a strictly convex function, since its second derivative F”(a) = 2A + 2C/a? is 


positive for a > 0. From the equation 


F'(a) = 2Aa — = =0 (7.24) 
da 


we find that F attains its unique minimum at the point 


3) C | 3C 
anin = 5A ae > 0. (7.25) 


From (7.22) we see that the right-hand side of (7.21) is positive for k < 0 and, 
therefore, a does not change its sign. By Theorem 7.1 the expansion function a = 
a(t) exists over the whole interval [t, 00) and is continuously differentiable there, 
since by (7.20) the lower bound a = a(t) > 0 holds for all t > t and the following 


linear upper bound 
: Cc 
a=,/Aa2+B+— <2VAa 
a 


holds for sufficiently large a. oO 


Let us still note that the function b(t) = exp(2V/ At) is the solution of the upper 
bound equation b = 2./Ab and thus 0 < a(t) < b(t) fort > ow. 

By Theorem 7.1 solutions a = a(t) of (7.21) for various initial conditions 
a(t) > O do not intersect and they also do not bifurcate, since the right-hand side 
of the equation a4 = ./F (a) is a Lipschitz continuous. In the next theorem, we will 
investigate the case k = | which is more complicated and quite technical. 
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Theorem 7.3 If the curvature index k = | and (7.20) holds, then for the solution 
of (7.21) with an arbitrary initial condition a(t) > 0 there exists at most one point 
ta € [t, co) such that a(t2) = 0. 


Proof Let F and din be defined again by (7.23) and (7.25), respectively. For k = 1 
consider the following three cases: 


1. Let F(amin) > 0. Then by (7.20) and (7.21) we have a(t) > /F (amin) > 0 
for all t > t, where Tt is given by (6.33). Therefore, the expansion function is 
again increasing over the whole interval [t, 00) due to the following linear upper 
bound @ < 2¥‘Aa for sufficiently large a. Note that this important case is often 
forgotten in the cosmological literature. 

2. Let F(amin) = 0. Then F(a) > O for all a ~ dmin and by (7.21) the second 
derivative is 


2Aa — C/a? 


C 
a= a= +(Aa-——,). (7.26) 
2,/Aa2+B+C/a ( 5a) 


Comparing this equation with (7.13), we observe that the sign + should only be 
considered. Moreover, we see that its value for ad = din iS 


Age = = 9-1/3 42/3C1/3 _ 9-1/3 42/3 01/3 — 9. (7.27) 
min 
If a(t) > admin then 
Cc 
a(t) > Admin — a = 0 


min 


and a = a(t) is strictly convex and increasing over the interval [t, 00) due to 
(7.21), see Fig. 7.5. 


7 f 


Fig. 7.5 Eddington-Lemaitre solutions a = a(t) of the Friedmann equation (7.21) for the case 
k = 1 and F(admin) = 0. Moreover, if (7.20) holds, then all solutions are increasing. For 0 < 
a(t) < Gmin the corresponding solution is strictly concave and for a(t) > dmin it is strictly 
convex. The constant Einstein solution a(t) = amin does not satisfy (7.20) 
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If a(t) = amin then a = a(t) is a constant function everywhere due to (7.21), 
(7.26), and (7.27) which contradicts (7.20). Hence, this case cannot happen. It 
corresponds to the Einstein static universe, which will be treated in detail in 
Theorem 7.4. 

If the initial condition satisfies the inequalities 0 < a(t) < amin, then 


Cc 
< Admin = =0 


2a2(t) 


and a = a(t) is strictly concave and increasing for all t € [t, 00). The sharp 
inequality in the above relation follows from the uniqueness, since no solution 
can cross the line a(t) = admin. 

3. Finally, let F (amin) < 0. Then there exist two roots aj < a2 such that F(a,) = 
F (a2) = Oand F is negative on the interval (a), a2). However, the left-hand side 
of (7.21) is nonnegative everywhere. Hence, the case a, < a(t) < a2 fort > T 
cannot happen. 

If a(t) = a, or a(t) = ay, then by (7.21) we have a(t) = 0 which contradicts 
(7.20), i.e., these cases can be excluded, too. 

If the initial condition satisfies a(t) > a2, then F(a) > 0 for all a > az and 
by (7.20) the expansion function is again increasing. 

We will not investigate the case a; < 0, since the radius of S3 is positive. So 
let the initial condition a(t) satisfy the inequalities 0 < a(t) < a,. Then F is 
positive on the interval (0, a;). Set 


C 
K := Aa, — —>. 
2a; 


Since 0 < aj < dmin, we obtain by (7.26) and (7.27) that 


eK Satya S20 
a(t) = Amin a2 


a(t) = Aa(t) — 
for all t > t such that a(t) > 0, i.e. 


a(t)< K <0 


which means that a = a(f) is strictly concave. By integration we find that 


t 
a(t) — a(t) = a(a)do < K(t—T) <0. 


Consequently, Kt — —oo as t > oo and since a(t) and KT are fixed numbers, 
there exists exactly one time instant t2 > t such that 


a(tz) = 0= F(a)). 
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Then the cosmological density parameters are not well defined due to the division 
by zero in (6.29). This case corresponds to a universe of finite duration. The 
unique solution of (7.21) is given by the equation d = ./F (a) on the interval 
[v, t2] and by the equation a = —,/ F(a) after the point f until a = a(t) reaches 
the zero value. 

| 


7.6 Einstein’s Static Universe 


Friedmann described the dynamical behavior of the universe as an alternative to 
Einstein’s static universe. Now we show that the expansion function associated to 
Einstein’s static universe also satisfies the Friedmann equation. Instead of (7.20) we 
shall assume that a = 0 and moreover, let k = 1 and let again 


A>0. (7.28) 


This special case corresponds to Einstein’s static universe from [94]. By (7.23) and 
(7.25) we have 


C 31 C2 [2A 
2 2 4 2: 3 
0 = F(amin) = Adin — € ee ry le +C ral 
AC2 | AC2 
=> (imei a we 


Hence, by (7.22) and (7.28), 
4 2c? 
C=c3},/— = : 7.29 
“V27A~ 3/A ea 


Substituting (7.29) into (7.25), we get the formula for the radius of Einstein’s static 
universe, namely 


oa TR (7.30) 


Notice that this solution does not require any initial and boundary conditions. 
However, Einstein in [94] did not ask a natural question whether stars could shine 
for an infinitely long time. The constant solution of Einstein’s equations (7.30) is 
thus unphysical. 
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Theorem 7.4 The total mass of Einstein’s static universe M and its mean density 
p can be expressed by the following fundamental constants 1, c, G, and A, 


mc? _ Ac? 
2G6VA lO” OG 


Proof By (6.2) the volume of the sphere S} with radius a is 
V = 277°. 
From this, (7.29), and (7.22) we obtain 


2c? 4GoV 4GM 
= 86 Sra Se 
JK A A 


which yields the proposed relation for M. From this and (7.30) we get the formula 
for the density o. This formula can also be obtained from (7.13) for d = O and 
p=O0. oO 


Remark 7.22 The mass of our Galaxy is about 10? kg. For comparison let us 
evaluate the mass of Einstein’s static universe by Theorem 7.4 for A ~ 107°? m7?, 


Yea eT (7.31) 
= EUR : g. : 


Hence, Einstein’s static universe is tuned very finely. However, the resulting 
stationary solution 


a= A~!/? = 10° m (7.32) 


of Eq. (6.32) is not stable, i.e., any small deviation from the constant a = a(t) = 
A—'/? will cause either a gravitational collapse, or expansion (see [249, p. 746] or 
[348, p.405]). We also see that the constant C in (7.29) is independent of the total 
mass of the universe, while in a more general case (7.22) the constant C depends on 
the total mass. Furthermore, we observe that the density parameter Q)y4(t) defined 
in (6.29) is infinity even though nothing dramatic happens. The mean density of 
baryonic matter is of course finite, namely 


M xc Ac? 


= = =~ 10776 kg/m}, 7.33 
Vo 2GVA-2n2a3 4G ae ce 


p= 


This density differs from that stated in Remark 7.27 below due to a special choice 
of A © 107? m7”. 
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Remark 7.23 The numbers appearing in the Friedmann equation (6.32) are very 
different in size in the SI units, e.g., A ~ 10~°? m7’, p(to) © 1077’ kg/m?, G = 
6.674-10-!! m3 kg! s~?, c = 3-108 m/s, and a(to) © 10° m. This might produce 
various numerical troubles in finite arithmetic precision. For k = | the radius a(to) 
is usually estimated as a(to) © c/H(to) © 102° m, but several times larger values 
can also be found in the literature. 


Theorem 7.5 The pressure p = 0 for Einstein’s static universe. 


Proof Since a = 0, we find from the linear Friedmann equation (7.13) and (7.30) 
that 


4a Gp 4nG 4 Ac? V/A 4 cask 0 
az a a= => . 
o ae 3 3 
The proof is completed. Oo 


7.7 Some Consequences of the Friedmann Equations 


Remark 7.24 Now let us calculate the inflection point of the expansion function 
a = a(t), i.e., when the strictly concave expansion function changes into the strictly 
convex function. Note that the inflection point need not exist in the interval [T, 00) 
under some special conditions, see e.g. Fig. 7.5. From (7.21) we find that 


D 2Aa — C/a* 
a= =0 
2/Aa2+B+C/a 
Hence, $ Aca = 2Aa*? = C and thus for the inflection point we obtain 


surprisingly the same relation as (7.25), i.e. the radius of Einstein’s static universe 
(see (7.29) and (7.30)), namely 


which is independent of the curvature index k. From this and (7.22) we get 2Ac? = 
82 Gp which for A ~ 107°? m7? yields the corresponding mass density 


_ Ac? 
~ ArG 


p ~ 10776 kg/m?3, 

Interestingly, its value is again the same as that in (7.33). Since the measured mean 
density in our neighborhood is smaller, the expansion function seems to be strictly 
convex at present. This fact was predicted already in 1931 by Georges Lemaitre 
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(219, p.422] (see also [249, p. 746], [272, 345], [348, p.411]). Moreover, from 
Remark 7.12 we immediately find that if 


Qq (13) = 22, (13) 


for some time f3 and a(t3) 4 0, then a(t3) = 0. 


Theorem 7.6 The Friedmann equations (7.21) and (7.13) imply that the pressure 
p= p(t)=Oforallt >t. 


Proof By differentiation of (7.21) we find that 


C 
2ad = 2Aaa — —a. 
a 


If k = 1 then by Theorem 7.3 there exists at most one point f € [T, oo) such that 
a(t2) = 0. If a 4 0, then we get 


i.e., this equation is valid except for at most one point f2. However, (7.13) can be 
rewritten as 


implying that p = 0, since p = p(t) is supposed to be continuous (including f2). 
oO 


Note that the zero pressure corresponds to incoherent dust (cf. (6.21)). This fits 
to astronomical observations for tf > T. 


Remark 7.25 If A = 0 and k = 1, then from (7.21) we get d?a” = c*(aa — a’), 
where a = C/c? is a positive constant. Using the conformal time 7 defined by 
(7.17), we find that 4 = £a’, where a’ = da/dn. Therefore, (a’)? = aa — a’. 


. Seah _ 2a . 
Employing the substitution x = = — 1, we obtain 


/ da -|- da =f dx _ in(—" ') 
vaa-a Jo {yp Aae Ve 
a 


Hence, 7 — 9 = arcsin( 24 = ) and sin(7 — no) = a — 1, which leads to — sin np = 


—1 and no = 2/2. Thus, a(n) = ae —cos7) for 7 € [0, 27] which leads to an 
oscillating universe, see also [348, p. 412]. 


Remark 7.26 Consider again the situation sketched in Fig. 6.2. Gravity (e.g. of our 
Earth or Galaxy) does not act on bodies which are in the observable universe, but it 
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acts on these bodies in their future cones (see also Remark 11.12 below). This fact 
is not described by the Friedmann equation (7.21). Hence, there is another problem 
of the standard cosmological model. 


Remark 7.27 Concerning the baryonic mass density Qp = 0.049 (see Fig. 6.9), we 
find by (6.29) and (6.31) that 


32pH} — 3- 0.049. (2.27)? - 10-36 4 5 3 
es = k =45-10-%k 
OO) RG 8x 6674-10-77 8/™ — 


for the present age of the universe fo. This is less than one proton in three cubic 
meters, since mp = 1.67 - 10-2? kg. However, it is usually said that there are 6 


protons per m? on averge. 


7.8 The Age of the Universe 


Expressing the expansion function a = a(t) as a Taylor expansion in time f = 0, 
which corresponds to the present time, we have by (6.27) and (6.29) 


a(t) = a(0) + a(O)t + str? +--+ =a(0) (1 + Hot — sa0tige +.. ) 
(7.34) 


1 c? > 
= a(0) (1-4 How 30350)! +). 
Here Hp = H(0) and go = g(0) = —0.6 is the usually accepted present value of the 
deceleration parameter (see [300, p. 110]) which is negative, since the expansion of 
the universe accelerates. This means that the expansion function a = a(t) is strictly 
convex on a given time interval, see Fig. 7.6. 

M. Carrera and D. Giulini [58, p. 175] correctly derive that at the distance of 
Pluto (i.e. about 40 au) the acceleration of the expansion of the universe is only 
2 - 10-73 m/s* which is indeed an entirely negligible quantity. A similar tiny value 
was derived by B. Mashhoon et al. [236, p.5041]. However, all these authors 
concentrated only on the single quadratic term in expansion (7.34) and did not 
consider the large value of the Hubble constant (6.31) which stands at the linear 
term in (7.34). In other words, an accelerated expansion does not manifest itself on 
scales of the Solar system, but the expansion itself is observable. In particular, we 
have 


C2 


3224 (0) 


1 29 1 
|Hot| > 3 |0l Hot = 3 |40! 
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Fig. 7.6 The assumed behavior of the normalized expansion function a(t)/a(0). The time variable 
is shifted for simplicity such that fg = 0 corresponds to the present time. The lower blue 
graph corresponds to the linear function | + Hot from (7.34) on the interval [—1/Ho, 0], where 
1/Hpj = 13.6 Gyr is the Hubble time. The upper green graph shows the quadratic function 
1+ Hot — 3q0He 1" with gg = —0.6. The middle red graph illustrates the behavior of the 
normalized expansion function calculated numerically fork = 1, Qy = 0.317, Qa = 0.683, 
and Hp = 67.15 km/(s Mpc), see [284]. We observe that the accelerated expansion differs only 
very little from the linear expansion during the last few Gyr 


for t close to 0. Consequently, the accelerated expansion given by the quadratic term 
only appears at cosmological distances. In spite of that, the single quadratic term is 
so small that the linear term | Hot| from (7.34) essentially dominates not only in the 
neighborhood of 0, but for all ¢ in the whole interval (—1/Hp, 0), since 


1 2.2 
0.3 -|Hot| > 5 |90l Hot ; 


where 5140! = 0.3. Recall that the physical dimension of the constant A is m7’. 


On the other hand, the physical dimension of Ho is s~! which does not depend on 
space variables. Therefore, we can detect the Hubble expansion also on small scales, 
where the effect of A is negligible. 

We cannot reliably estimate the remainder of the Taylor series on the whole 
domain of definition, since the first derivatives of the expansion function a = a(t) 
were extremely large just after the Big Bang. To see this, denote by 


to = 13.82 Gyr 


the estimated age of the universe by the standard cosmological model, see [290]. 
The microwave background radiation appeared tj = 380,000 years after the Big 
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Bang, see (6.33). Then for the measured cosmological redshift z = 1089 we find 
that 


tya(to) ty 380,000 - 1090 1 
—— —(z + 1) — 9 — > 
toa(t)) to 13.82 - 10 33.3 
where the first equality follows directly from (see [249, p. 730], [272, p.96]) the 


definition of redshift. From this we get 


a(to) _ ath) 
to ~ ty , 


33.3 - 


and thus the mean expansion speed of the universe on the interval (0, t,) was 33.3 
times larger than on the interval (0, to). This indicates that the expansion function 
had a much larger derivative after the Big Bang than today and that it was strictly 
concave on some subinterval (cf. Figs. 7.6 and 6.2). 


Remark 7.28 The oldest known star is HD 140283. Its age is estimated to 14.46 + 
0.8 billion years independently of cosmological models. Since it is in our close 
neighborhood (only 190 ly away from us), it is very probable that there are even 
older stars in the entire universe. This age would possibly conflict with the calculated 
age 13.82 billion years of the universe as determined from the Friedmann equation 
by the Planck satellite data. 


7.9 Flight Around the Universe 


Consider a homogeneous and isotropic universe which can be modeled by three 
maximally symmetric manifolds S*, E*, and HI up to scaling (see Theorem 6.2). 
Let us ask a purely theoretical question: 


Could we orbit the universe in finite time? 


If the curvature index k < 0, then the answer is NO, since the space is unbounded. 
So let k = 1. In the next example we show that for Einstein’s static universe the 
answer is obviously YES. 


Example 7.7 Assume that the universe is modeled by the three-dimensional sphere 
5? with fixed radius a = A~!/?, see (7.32). Further, assume that a space shuttle 
travels along a given great circle (geodesic) with constant velocity v = 0.999 c. 
Then its total cosmic travel time T can be estimated by the relation 1 yr + z - 10’ s 
as follows 


2a _ 2n - 10° 2x - 1076 


T= ~ R = 
' 3.108 °  3-108n-107> 


6.666 - 10!° yr (7.35) 


(which holds also for the speed of light). This is five times the present age of the 
universe fg = 13.82 Gyr according to the standard cosmological model. Note that 
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a(T) 


) 


Fig. 7.7 Trajectory of a space shuttle in an expanding universe modeled by the sphere Sy for 
a(T) = 2a(0). Its equation in polar coordinates reads r(v}) = a(0) exp(0 V/v) and its length is 
(a(T) — a(O))/ cos arctan(v/V) 


A (if it exists) is known only very roughly, see (7.32). However, the proper time of 
the traveler would be much smaller than 7 due to time dilation (2.5). 


Further, we shall consider a variable radius a = a(t) of the sphere S3 and for 
simplicity, we assume that it increases linearly with constant velocity V > 0 (cf. 
Fig. 7.6). The expansion velocity V can be even faster than light and there is no 
contradiction with STR outside the observer’s inertial system, see [79]. Moreover, 
assume that the space shuttle has also a constant velocity v > 0 with respect to 
its local neighborhood (i.e. intergalactic dust). In this particular case, its trajectory 
will be one turn of the logarithmic spiral instead of the great circle, see Fig. 7.7. 
Recall that the logarithmic spiral is a self-similar curve making a constant angle 
0 € (0, 2/2] with the polar radii at all its points [296, p. 139], (for an approximately 
logarithmic double spiral pattern see also Fig. 8.7). 

The following surprising theorem was inspired by Lubo§ Pick and Mirko Rokyta 
(see [283, p. 228]). It states that if the space shuttle has an arbitrarily small constant 
velocity v > 0 with respect to V, then it always needs finite time to travel around a 
linearly expanding universe. Therefore, we will deal with the nonrelativistic addition 
of velocities in Theorem 7.7 below. The spiraling trajectory can be easily unfolded 
into a straight line. Thus, in fact, we shall investigate only a one-dimensional 
problem. For brevity, we also make a linear shift of cosmic time so that tf = 0 
corresponds to the start time. 


Theorem 7.7 (Pick—Rokyta) Let S3 (1) expand with constant velocity V > 0, ie. 
a(t) = V for all t > 0, and let v > 0 be a constant velocity of the space shuttle. 
Then its total cosmic travel time around the universe is 


exp(27 V/v) — 1 


T =a(0) V 


(7.36) 
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Proof Assume that the space shuttle will launch from the axis x at time tf = 0. The 
circumference of a great circle (geodesic) of S3 at time ¢ > 0 is clearly given by 


2ma(t) = 2m(ao+ Vt), (7.37) 


where ap = a(0) is the initial radius. Let b(t) be the instant distance of the space 
shuttle at time t > 0 along the great circle with radius a(t) from the axis x, i.e. from 
the launch point in an expanding universe (see Fig. 7.7). It satisfies the following 
initial problem 


b(t) =v+w(t), b(0)=0, (7.38) 


where w = w(t) is the drifting velocity of a linearly expanding space. We see 
that the ratio between the velocity w(t) and the expansion velocity 27 V of the 
great circle is the same as the ratio between the distance b(t) and the circumference 
27 a(t), namely, 


wit) b(t) 
IV 2malt)’ 


From this, (7.37), and (7.38) we get the initial problem 


b(t) =v+V 


oO. gad. SOV SO: (7.39) 
Vt 


40 


The right-hand side of this first order differential equation contains the velocity 
v which is increased by the drifting velocity w(t) of space expansion. By the 
substitution tf = O and differentiation, we can easily verify that 


ag+Vt 


ao 


bi) = aa v)— lis (7.40) 


is the solution to problem (7.39). The space shuttle will return to the launch point 
from the opposite direction at time T when 


2na(T) = b(T). 
From this, (7.37), and (7.40) we obtain the equation 


v. agtVT 
— In ——— = 


2m. 7 Al 
Vv - (7.41) 
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Consequently, 


2nV 


Uv 


ag + VT = ao exp 


(7.42) 


and thus (7.36) follows. oO 


Remark 7.29 The assumption that V > 0 is constant in Theorem 7.7 is not too 
restrictive, since the expansion function a = a(t) in Fig. 7.6 is almost linear during 
the last 10 Gyr. Moreover, for the linear function a(t) = ag + Vt the associated 
Hubble parameter 


PC es 
7G) eee 


has a similar behavior as that in Fig. 10.4. By l’ Hospital’s rule, (7.36) converges for 
V > Oto T = 2za(0)/v corresponding to Einstein’s static universe (7.35). 

The assumption that v > O is constant in Theorem 7.7 is also not restrictive, 
since for each variable velocity of the space shuttle greater than v we clearly get a 
smaller (i.e. finite) travel time. 


Remark 7.30 For simplicity, we have chosen a(T) = 2a(0) in Fig.7.7. From 
this and (7.41) it follows that v/V = 27/1n2. Hence, the constant slope angle 
0 between the polar radii and the tangent line to the spiral is only slightly less 
than 2/2, where 0 = arctan(v/V). Nevertheless, Theorem 7.7 covers also the case 
0 <u <c < V which is more realistic. 


Now we show that the CMB radiation (the cosmological horizon) might be just 
the image of the antipodal point of our neighborhood ~ 14 Gyr ago. 


Example 7.8 To illustrate the main idea of the above hypothesis, consider the 
trajectory of a photon which is moving at the velocity v = c with respect to its 
local space. Then for the trajectory from the antipodal point of 30 to our present 
location we find by (7.37) and similarly to (7.42) that 


mV 
a(t) = a(O) + Vt = a(0) exp —, 
c 


where f is the travel time. From this and (7.1) the cosmological redshift of the photon 
is given by 
a(t) 
Z= — —1=exp(2.227) —1 1075 for V = 2.22c. 
a(0) 
This value is quite close to the observed redshift z = 1089 of the CMB (see Fig. 6.1), 


even though the actual expansion velocity of the universe is variable and not a fixed 
constant V. 
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7.10 Conclusions 


The current ACDM cosmological model supposes that time flows completely 
uniformly from the Big Bang on. By [361, 362] the proper time should be replaced 
by the conformal time (7.17). The main objections against the Friedmann equation 
can be summarized as follows: 


1. It implies the existence of dark matter and dark energy that have not been 
confirmed by measurements, yet, see Sect. 6.7. 

2. It was derived by excessive extrapolations from Einstein’s equations applied to 
the entire universe, see Sect. 7.2. For instance, the Hubble-Lemaitre constant 
was calculated by extrapolating 13.82 Gyr old CMB data to the present time, 
see Remark 6.23. This is a typical ill-conditioned problem. 

3. It is too simple model described by the first order ordinary differential equation 
with constant coefficients, see Sect. 7.5. 

4. It yields a smaller age of the universe than the oldest stars in our neighborhood, 
see Remark 7.28. 

5. It is fully deterministic and reversible, see Remarks 5.6 and 7.3. 

6. It is used to calculate the time and space distances, see Sects. 7.3 and 7.4. The 
difference in the denominator of (7.9) can produce numerical instabilities, see 
Remarks 7.4 and 7.15. 

7. The cosmological parameters allow division by zero, see Sect. 6.7. 

. The cosmological parameters have a strange behavior in time, see Remark 6.21. 

9. No significant digit of the cosmological constant is known, see Remark 3.15. 
Let us emphasize that this constant stands at the leading and most dominant 
quadratic term in (7.21). 

10. The emergence and evolution of a homogeneous and isotropic hyperbolic (or 

Euclidean) universe cannot be well described by the Friedmann equation, since 
this would require an infinite speed of information transfer, see Remark 6.27. 


oo 


Chapter 8 Mm) 
Arguments Against the Proclaimed cree | 
Amount of Dark Matter 


8.1 Problem of Missing Matter 


The term dark matter was first briefly mentioned in Jan Oort’s paper [266] for 
mass that is not visible. One year later this term (in German dunkle Materie) was 
independently used by Fritz Zwicky in his 1933 paper [385] on page 125. This 
paper essentially changed the development of astronomy and cosmology for many 
decades. Using the Virial Theorem, Zwicky found that in order to explain rapid 
movements of only eight galaxies in the giant galaxy cluster Abell 1656 in Coma 
Berenices (called Coma cluster), he had to assume the existence of a 400 times 
larger amount of nonluminous matter than luminous to keep the cluster together. 
In 1936, Sinclair Smith [326, p. 27] independently reached a similar conclusion for 
the Virgo cluster. The huge factor 400 was reduced to 150 in another paper [386] 
by Zwicky. None of the above-mentioned authors claimed that dark matter must 
be nonbaryonic other than just being optically invisible. But in modern terms dark 
matter stands for hypothetical nonbaryonic dark matter of unknown composition, 
i.e., it does not include baryonic matter that is not luminous (i.e. also black holes). 
Moreover, it does not interact electromagnetically with “normal” matter. At present, 
it is supposed that 


the ratio of masses of dark matter to baryonic matter ~ 6:1. (8.1) 


The aim of this chapter is to collect the main arguments from our papers [165, 
183, 185] showing that this ratio is exaggerated. We do not claim that dark matter 
does not exist. However, we do claim that the ratio on the left-hand side of (8.1) 
is smaller and it can even be zero. We present several facts and phenomena which 
should be taken into account to estimate the amount of dark matter. 

In Sect. 8.2, we briefly recall the Virial Theorem which establishes the equilib- 
rium of the total kinetic and potential energy of gravitationally interacting bodies 
of a stable system. Then we present Zwicky’s method and comment on it. We also 
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point out several effects that were not taken into account by Zwicky and that may 
essentially reduce the calculated mass of the whole Coma cluster. 

In Sect. 8.3, we present two “back of the envelope” calculations illustrating 
whether it is necessary to assume extragalactic dark matter in the Coma cluster. 
In Sect. 8.4, we analyze rotational curves proposed by Vera Rubin and in the last 
section we give some concluding remarks on the existence of dark matter. 


8.2 Analysis of Zwicky’s Method 


Zwicky in [385, 386] investigated large velocities of galaxies in the Coma galaxy 
cluster which is located near the north Galactic pole. Therefore, the recession speed 
of the Coma cluster from the Sun is practically equal to the recession speed of the 
Coma cluster from the Milky Way, even though the orbital speed vg ~ 230 km/s of 
the Sun about the Galactic center is relatively high. Using the Virial Theorem 


L277 SU, (8.2) 


Zwicky estimated its mass M by the following simple formula 


M= oy (8.3) 
3G’ 
where V = — 2 GM? /R is the total potential energy of the cluster, 7 = 5M VU’ is the 


kinetic energy, R is the radius of the cluster, and v is the mean quadratic speed with 
respect to the center of gravity of the cluster. For detailed derivations of the Virial 
Theorem and formula (8.3) we refer to [183]. Note that Smith [326, p. 29] presents 
a similar formula to (8.3), where the coefficient 3 is replaced by 5. This is a quite 
big difference. 

To explain that 


3 > 1000 km/s 


in the Coma cluster, Zwicky had to assume the existence of a large amount of 
nonluminous matter to keep the cluster gravitationally bound together. However, he 
used very imprecise data and made many simplifying assumptions. For instance: 


1. Zwicky assumed that the distance to the Coma cluster is 14 Mpc which was 
measured by Milton Humason. However, the actual distance is about 100 Mpc. 
Therefore, also Hubble in [134] got the following large value of the Hubble 
constant Hyp = 500 km s~!Mpc7!, see Remark 6.17. 

2. Zwicky was mistaken by two orders when calculating masses of galaxies in the 
Coma cluster from their luminosity. 


8.2 


10. 


of 
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. He used Newtonian mechanics with an infinite speed of gravitational interac- 


tion, while the actual speed is finite. 

He replaced galaxies of diameter ~10!° au by point masses. However, points 
cannot rotate. Hence, this does not allow one to consider angular moments of 
rotating galaxies which also contribute to the total angular momentum. 


. Tidal forces that influence the dynamics of the system were not considered. For 


instance, an isolated binary system of two galaxies that orbit closely to each 
other is not stable, since galaxies will merge due to tidal friction, whereas the 
classical two-body problem has a periodic solution. 

Zwicky substituted the spacetime curved by a thousand of galaxies by 
Euclidean space. 


. The gravitational redshift was ignored. Photons have to overcome the potential 


well of each star, the potential well of the corresponding galaxy, and also the 
potential well of the whole Coma cluster. 


. Zwicky assumed that the Virial Theorem (8.2) holds absolutely exactly and that 


the Coma cluster is in equilibrium. 

He supposed a uniform distribution of galaxies inside the Coma cluster, see 
[386, p. 229]. However, the central region is much denser than the region around 
the boundary and larger galaxies are closer to the center. 

In Zwicky’s papers [385, 386], an isotropic distribution of velocities was 
assumed and relativistic effects of high velocities were ignored. 


There are many other nonnegligible phenomena and facts (e.g., the existence 
nonluminous intergalactic baryonic matter, local Hubble expansion, thousands 


of dwarf galaxies of the Coma cluster that were unknown to Zwicky, gravitational 
lensing, see Fig. 8.1) which were not accounted for in Zwicky’s method, see [183] 
for details. A large amount of errors of various origins may essentially distort the 
resulting calculation of the mass. Thus we cannot make any reliable estimates of the 
actual mass of the Coma cluster by means of (8.3). The discrepancy between the 
virial mass and luminosity mass can be essentially reduced by means of actual data 
(see Fig. 8.2). 


Fig. 8.1 A schematic illustration of the selflensing effect. The observation angle 8 = <ABC is 
slightly larger than the angle a = <ABC due to the bending of light caused by the gravitation of 
a galaxy cluster itself, see [184, p. 117] 
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Fig. 8.2 Histogram of radial velocities less than 25,000 km/s of galaxies which are projected to the 
Coma cluster region. Here we consider only those galaxies whose magnitudes do not exceed 20. 
These data are available in [10], [31], and [69]. Galaxies possessing blueshift are on the left. The 
dark line represents the Gaussian curve fitted to those data that correspond to galaxies belonging 
to the Coma cluster. The other galaxies, which are contained in the same region of the sky, are not 
contained in this cluster, since their velocities with respect to the center of gravity of the cluster 
are too large. They should not be used to calculate the virial mass (8.3), because they do not form 
a bound system 


8.3. Two Simple Examples 


Now we present two examples showing that (8.1) is exaggerated. They can be 
convincingly verified by simple hand calculations. 


Example 8.1 The two largest galaxies in the Coma cluster are supergiant elliptic 
galaxies NGC 4889 and NGC 4874 which have almost the same luminosity (see 
Fig. 8.3). For simplicity assume that they have the same mass m and that they orbit 
along a circular trajectory with center O, radius r, and velocity vu (see Fig. 8.4). 
If one of these two galaxies were to be smaller, it would orbit the larger one by a 
higher velocity and along a longer path. Then it would meet and then absorb more 
neighboring galaxies than the larger one. By this mechanism the masses of both 
supergiant galaxies are balanced. By [397] they are ten times more massive than our 
Galaxy, i.e. 


m = 10Mg = 10° Mo =2- 10" kg, (8.4) 
where the total mass of our Galaxy Mg = 10!* Mo is given in [214, p. 127]. 


Since the gravitational potential inside a homogeneous spherical layer is con- 
stant, external galaxies and possible dark matter outside the sphere with center 
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Fig. 8.3 Giant galaxy cluster Abell 1656 in the constellation Coma Berenices. In the middle there 
are two supergiant elliptic galaxies NGC 4889 and NGC 4874 (photo NASA) 


‘@) m 


me 


Fig. 8.4 An idealized circular trajectory of the two largest galaxies in the Coma cluster. The 
recession velocity of the center O from us is v 


O and radius r have almost no influence on this motion, see Theorem . From 
Newton’s laws we get 


Gm? mv- 
4p2 = >" (8.5) 


The distance of both galaxies on the celestial sphere is 8.15’ which in projection on 
the distance 100 Mpc gives 7.32 - 107! m. Thus for the radius r we have 


r > 3.66- 107! m. (8.6) 
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According to [10, p. 19], the measured radial velocities of both supergiant galaxies 
are 6472 km/s and 7189 km/s, respectively. Their average velocity 


v = 6830.5 km/s 
nicely corresponds to the mean recession speed of the whole cluster 6877 km/s, see 


[183, p.53]. For the radial velocity v;adia) with respect to v we get by (8.4), (8.5), 
and (8.6) 


7,189,000 — 6,472,000 G 
3.585 - 10° = 5 =a eS Va 


m/s = 3.02- 10° m/s. (8.7) 


a 6.674 - 10-1! .2- 103 
- 4 - 3.66 - 10?! 


Comparing the left-hand and the right-hand sides, we find a small discrepancy. 
This simplified example shows that Newtonian mechanics does not describe reality 
correctly or the masses or radial velocities of both the elliptic galaxies are wrongly 
estimated or we have to assume the existence of invisible matter between galaxies. 


Remark 8.1 Slightly different velocities 6505 and 7108km/s of the two giant 
galaxies as given in [31] would even yield a smaller value 3.015 - 10° m/s on the 
left-hand side of (8.7). Moreover, considering the gravitational influence of other 
matter that is inside the sphere with center O and of radius r, the right-hand side of 
(8.7) would even be larger, i.e., no discrepancy would appear. 


Remark 8.2. By Tutukov and Fedorova [353] the intracluster medium contains 30- 
50% of stars from all the stars of the cluster. According to [14, 35, 366], clusters of 
galaxies contain five times more baryonic matter in the form of hot gas producing 
X-rays than baryonic matter in galaxies themselves. Also [35] and [135] claim that 
the intracluster medium contains more baryonic matter than the matter of all stars 
in a given galaxy cluster. Moreover, the lower bound in (8.6) is slightly smaller, 
since the cluster itself magnifies the angular distances due to lensing effects (see 
e.g. Fig. 8.1). Hence, the right-hand side of (8.7) should be larger. Therefore, it does 
not seem that there would be six times more dark matter than baryonic matter as 
proposed in (8.1). 


Example 8.2. Assume, for simplicity, that the additional baryonic matter has a 
spherically symmetric distribution and denote its mass by M. Concentrating the 
mass M to the center O, the speed v in (8.7) can be, using Newton’s First 


Theorem 1.1, replaced by 
= [/|G(m+ 4M) 
v = ,/ ——_—— > 0 
4r 
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To see this, it is enough to consider the relation 


Gm2 no 4 GmM - mv 
4r2 4 ror 


instead of (8.7). Zwicky’s paradox of large observed velocities thus disappears, since 
it has a quite natural explanation without dark matter. 


8.4 Analysis of Rotation Curves by Vera Rubin 


It is said that Vera Rubin’s greatest discovery was the fact that spiral galaxies 
have “flat” rotation curves [310]. On that basis, in the ’70s of the last century 
she developed her own theory of rotation curves of galaxies. From the high orbital 
speed of stars she concluded that galaxies should contain much more nonluminous 
than luminous matter to be kept together by gravity—see e.g. her review articles 
(307, 309] on dark matter. 

In nearby spiral galaxies Rubin found, see [309] and also [311, p. 480], that all 
stars of these galaxies move at almost the same constant speed of order v ~ 200 
km/s for r > ro, where 79 > O approximately corresponds to the radius of the 
central bulge and is typically equal to a few kpc (see Fig. 8.5). On the other hand, 
she observed that for r < ro the inside of the spiral galaxy (including a possible 
bar) rotates with roughly constant angular speed in a manner like that of a DVD 
record, i.e., the speeds of these stars are approximately linearly proportional to their 
distance from the center (see Fig. 8.5). An exception is a very close neighborhood 
of the central black hole, see Remark 8.3 below. 

In this section we shall look more closely at Rubin’s hypothesis. Consider a test 
particle of mass m (typically this will be a star) and let M >> m be the mass of some 
body generating the central force field. Assume that the test particle revolves about 
the center along a circular orbit with radius r and speed v. Then from Newton’s law 


0 4 20 ‘Fr (kpc) 
Fig. 8.5 The solid line shows an idealized rotation curve whose shape was derived by Rubin by 
means of measurements. It is flat for r > rg © 4 kpc. The dashed line shows the rate of decrease 
of velocities for Keplerian orbits that depend on the distance r from the center of a spiral galaxy 
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of gravitation and the relation for centripetal force Rubin easily obtained that, see 
[307], 


Mm mv? 
C= (8.8) 
r r 
that is 
GM 
v=,/—. (8.9) 
r 
The velocity v of a particle on a circular orbit is thus proportional to r~!/?, see 


Fig. 8.5. Such orbits are called Keplerian. 
In [310, p. 491] Rubin states that 


the stellar curve does not decrease as is expected for Keplerian orbits. 


Remark 8.3 To explain this discrepancy, it is important to realize that spiral galaxies 
do not have a central force field except within a close neighborhood of the center, 
where e.g. in the Milky Way the stars S1, $2, ... orbit a central black hole according 
to Kepler’s laws with velocities up to 7000 km/s. The mass of this black hole is 
roughly 4 million solar masses, which is less than 0.01% of the total mass of our 
Galaxy Mc © 10!7Mo = 2- 10* kg, see (8.4). 

In the Solar system, on the contrary, 99.85% of the mass is concentrated at 
the Sun. The planets barely interact gravitationally among themselves and their 
movements are determined mainly by the central force of the Sun. On the other 
hand, trajectories of stars in a galactic disk are substantially influenced mainly by 
neighboring stars, because the central bulge contains only about 10% of all mass of 
a galaxy. 


Remark 8.4 It is noteworthy to mention the following big mathematical paradox 
of Rubin’s observations. The stars of spiral galaxies are measured to move at 
an almost constant speed on different radii (see e.g. [309, p.7]) which causes a 
differential rotation. However, these galaxies are not winding up and surprisingly do 
not show an expected tightening of arms as shown in Fig. 8.6. The idealized constant 
orbital velocity in Fig. 8.5 for r € [4, 20] kpc produces special spirals of the form 
(r cos(a + b/r), r sin(a + b/r)) for some suitable constants a and b. Therefore, 
their shape cannot be stable if they have gone through so many revolutions. This 
is known as the winding problem or wind-up problem [165, 362]. To see this we 
present the following example. 


Example 8.3 Consider a spiral galaxy as marked in the upper left part of Fig. 8.6. 
Assume that the outer radius is 20 kpc and the inner radius rp = 4 kpc (see Fig. 8.5). 
Then after one revolution of a star on the outer orbit a star on the inner orbit makes 
20 : 4 = 5 revolutions if it has the same speed. This contradicts observations (see 
e.g. the galaxy M51 in Fig. 8.7). So there is has to be some systematic error. 
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Fig. 8.6 The flat rotation curve from Fig. 8.5 causes a very quick tightening of spiral arms after 
one revolution of the most external stars. This takes approximately 400 Myr for a typical spiral 
galaxy (note that the age of the Sun is more than 10 times larger). Here stars on the galactic 
edge revolved about the angles: 0 (an idealized initial state), 7/4, 7/2, m, 32/2, and 27. The 
corresponding time in given in Myr. After a relatively short time period of 100 Myr we would 
get a shape similar to Fig. 8.7. After a few hundred million years the spiral arms would be highly 
twisted. After 1 Gyr they would be completely wrapped up. However, observed spiral galaxies 
exist for many Gyr 


With difficulty it can also be assumed that the arms of e.g. type Sbe galaxies are 
formed by some kind of density waves as suggested in [29, p.544]. Such barred 
spiral galaxies resemble an open letter S. Therefore, some astronomers did not 
become convinced of the need for dark matter halos in spiral galaxies, see e.g. 
[206, 238, 246, 316]. 

Now we shall investigate orbital speeds of stars in our Galaxy to show that no 
dark matter is necessary. 


Example 8.4 Our Sun orbits the center of the Milky Way with the speed 
Veo = 230 km/s (8.10) 


(most sources give the range of 220-240 km/s) on a path of radius rg = 8 kpc. The 
radius of the visible part of the disk of our Galaxy is estimated by 


rg = 16 kpc = 4.938 - 10° m, (8.11) 
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Fig. 8.7 Spiral galaxies could not have such a large symmetry if there were to be six times more 
uniformly distributed dark matter than structured baryonic matter (photo NASA) 


i.e., the Sun stays about halfway out from the center O of the Galaxy. Stars orbiting 
the center of our Galaxy at any distance r > ro © 4 kpc should have a speed similar 
to Ue due to the expected flat rotation curve (see Fig. 8.5). 

Denote the mass of all baryonic matter contained inside the ball of radius (8.11) 
centered at O by M(rcg). By MikulaSek and Krti¢éka [245, p.353], we have the 
following guaranteed lower bound 


M(rg) = 3.85- 104! kg. (8.12) 


We shall proceed in two steps: 


1. First, let us concentrate all baryonic matter contained inside this ball to the central 
point O. Then from relations (8.9), (8.11), and (8.12) the velocity of a test particle 
on the orbit of radius rg is 


674. 10711. 3.85 104! 
pa) eG) ge SOE fig ne cee 
- 4.938 - 1020 


(8.13) 


8.4 Analysis of Rotation Curves by Vera Rubin 179 


Fig. 8.8 A ball with 
symmetrically distributed 
mass with respect to the 


horizontal plane acts on a test 
particle by a smaller force 
than the mass projected 1 \ Ss m 


perpendicularly to the Ba 
horizontal plane of the disk — 
dashed 


By this simple “back of the envelope” calculation, we see that v is indeed 
comparable to the measured speed (8.10). Although these relations are only 
approximate, to postulate the existence of six times more dark matter than 
baryonic matter (see (8.1)) to hold the Galaxy together by gravity seems to be 
highly overestimated. 

2. Second, we claim that the orbital velocity around a flat disk of the same mass is 
even higher than that in (8.13). This results from the following theorem proved 
in [184, pp. 131-134]. 


Theorem 8.1 A particle orbiting a central point mass along a circular trajectory 
of radius R has a smaller speed than if it were to orbit a flat disk of radius R and 
the same mass with an arbitrary rotationally symmetric density distribution. 


To explain the main idea of the proof, consider the situation in Fig. 8.8. Let two 
point masses m; = mz be located inside a ball placed symmetrically with respect 
to the horizontal plane. Let a test particle with mass m be in this plane. Then the 
total force F of both point masses acting on the test particle of mass m will be 
less than the force F of both point masses projected perpendicularly to the disk and 
acting on m. Let d be the distance between m, and m. Denoting by b its orthogonal 
projection on the horizontal plane, we find that 


2 = 2. 
celia and PoaG- 


F=G 
| b2 


Thus we see that the ratio of forces F and F is equal to the third power of the 
fraction d/b: 


This cubic nonlinearity causes a greater attractive gravitational force by the disk 
than by the ball (cf. (8.8)), and thus also a higher orbital speed around the disk. For 
instance Gy = Gy = 2. The above Theorem 8.1 thus explains the large orbital 
velocities of stars in our Galaxy, even though spiral galaxies do not have rotationally 
symmetric distribution of mass. It also indicates why (8.1) is overestimated. 
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Remark 8.5 Rubin in [306, p. 1340] and [308, p. 29] assumed a spherical symmetry 
of the whole galaxy and that the density M(r)/r? decreases as r~*. Then from the 
relation GM(r)m/r? = mv(r)/r (cf. (8.8)) she got v(r) = const. This is indeed 
only a very rough calculation. 


Remark 8.6 Orbital speed v satisfies the well-known baryonic Tully-Fisher law 
v* ~ M, where M is the mass of a spiral galaxy, see [351, 380] and also [178, 
p. 75]. 


In [184, p. 131], we prove why the force of a disk-shaped galaxy acting on a test 
particle is much larger than it would be if its whole mass were to be concentrated at 
one central point (cf. Newton’s First Theorem 1.1). Therefore, the speed v of stars 
on circular orbits in a spiral galaxy should be higher than for Keplerian orbits (see 
Fig. 8.5). 


Remark 8.7 The average thickness of the disk (except for the bulge) of spiral 
galaxies varies from 300 pc to 1 kpc. It is therefore about 30-100 times thinner 
than the diameter of the visible part of the galaxy. This is easily seen when galaxies 
are observed edge-on. Moreover, the gas and dust are mainly found close to the 
plane of symmetry of the disk [291]. Consequently, in [185] we treat the disk just as 
a two-dimensional body, which is obviously a better approximation than a central 
point mass. The gravitational field of spiral galaxies will therefore be approximated 
by the gravitational field of a flat disk with rotationally symmetric mass density 
distribution. 


8.5 Final Remarks 


In previous sections we introduced a number of arguments showing that the 
amount of dark matter as given in (8.1) seems to be considerably overesti- 
mated. Consequently, the amount of dark matter proposed by the standard ACDM 
(i.e. Lambda-Cold Dark Matter) cosmological model is mistakenly determined, or 
that dark matter does not exist at all. This model based on the Friedmann ordinary 
differential equation (7.21) of the first order is too simple and thus cannot yield 
correct conclusions. 
Let us briefly mention several further arguments: 


1. The influence of dark matter in the Solar system has not been observed (see 
[251]), even though our Sun is a large gravitational attractor. 

2. At the end of the twentieth century it was thought that red dwarfs of spectral class 
M form only 3% of the total number of stars, see e.g. [29, p. 93]. Nevertheless, 
according to the current Harvard spectral classification of stars, red dwarfs are in 
the vast majority—over 76% (cf. Table 8.1). The main reason is that they stay on 
the main sequence a very long time and they are not visible due to the selection 
effect. However, they essentially contribute to nonvisible baryonic matter at large 
distances. The same is true for the other spectral classes L (red-brown dwarfs), T 
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Table 8.1 The Harvard spectral classification of stars by [399]. The first column shows the main 
spectral classes of stars, T is their surface temperature, and MM is their mass. The last two columns 
contain their maximum age on the main sequence of the Hertzsprung-Russell diagram and the 
corresponding fraction of all main-sequence stars 


Class Color T (K) M/Mo Age (Myr) Fraction 
O Blue >30,000 >16 1-10 0.00003% 
B Blue white 10,000-—30,000 2.1-16 11-400 0.13% 

A White 7500-—10,000 1.42.1 400-3000 0.6% 

F Yellow white 6000-7500 1.04-1.4 3000-7000 3% 

G Yellow (Sun) 5200-6000 0.8-1.04 7000-15,000 7.6% 

K Orange 3700-5200 0.45-0.8 ~20,000 12.1% 

M Red 2400-3700 0.08-0.45 ~50,000 76.45% 


(brown dwarfs), and Y (black dwarfs). However, these dwarfs are not considered 
as ordinary stars. 

3. By McLaughlin and van der Marel [239] the ratio between virial mass and 
luminous mass in globular clusters is less than two. 

4. Dark matter should only interact gravitationally (and perhaps also weakly). 
Particle physics experiments looking for dark matter (i.e. hypothetical particles 
such as axions, neutralina, wimps, etc.) or dark antimatter found no evidence of 
their existence. 

5. Most of the galaxies have a spiral structure. If these galaxies were to contain 
six times more uniformly distributed dark matter than baryonic matter, then they 
could not exhibit such a high symmetry of structured baryonic matter as seen 
e.g. in Fig. 8.7, since they would be governed by a dark matter halo. 


Remark 8.8 Itis very probable that Newton’s law of gravity and the General Theory 
of Relativity on large cosmological scales approximate reality only very roughly 
and thus the proclaimed dark matter is nothing else than a modeling error. Several 
modifications of Newtonian theory, e.g. MOND (Modified Newtonian Dynamics) 
[238, 246, 316] and its relativistic generalization Te VeS (Tensor-Vector-Scalar) [27], 
are at present being developed and studied. Effects that are attributed to dark matter 
are explained by a different form of the gravitational law on large scales. However, 
MOND contradicts the principle of causality (a consequence cannot precede its 
cause), since it assumes an infinite speed of gravitational interaction, 1.e., it cannot 
be a good model on cosmological scales. In [157] and [184, p. 253], we included 
delays given by a finite speed of gravity into the system (1.14). According to 
Asher Yahalom [379, 381], any changes inside galaxies may affect other parts 
tens of millennia afterwards. Also the Tully-Fisher law can be explained by taking 
retardation effects into account, see [380]. It is a pity that these retardation effects 
are not included in the present day analysis of galaxy evolution. Finite speed of 
gravitational interaction contributes to circularization of elliptic orbits, since larger 
velocities produce larger gravitational aberration angles. 
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Remark 8.9 Dark matter cannot be hot (having relativistic velocities), since then we 
would not observe complicated structures of baryonic matter. 


Pavel Kroupa (see e.g. [205, 206]) presents a number of observations which rule- 
out the existence of dark matter particles. He shows that there are no dark matter 
halos, since then our neighboring dwarf galaxies would slow down due to dynamical 
friction with dark matter and they would spiral into the center of the Milky Way. This 
is not observed and too many dwarf galaxies are even flying away from the Local 
Group too fast. Moreover, several dwarf galaxies orbit our Galaxy and M31 in a few 
planes (i.e. they are isotropically correlated) which would be impossible with dark 
matter. The dynamical friction of dark matter halos would also slow down central 
bars of spiral galaxies, cf. also [304]. Too long tidal tails of several large galaxies 
are also not consistent with dark matter halos. 

At present there are many other papers claiming that it is not necessary to assume 
the existence of dark matter on scales of galactic disks, that the ratio (8.1) is highly 
overestimated or that dark matter does not exist at all, see e.g. [101, 111, 140, 165, 
183, 185, 251, 285, 400]. 

On the other hand, Douglas Clowe et al. in [67] assume a collision of two 
galaxy clusters, where the intergalactic gas is stopped, while the galaxies continue 
in an unchanged direction together with presumed dark matter. The title of this 
paper A direct empirical proof of the existence of dark matter should imply that 
dark matter was finally found. Nevertheless, we are not able to measure tangential 
components of the velocities of these clusters to prove that the collision really 
happened. The authors neglect dynamical friction of particular galaxies and suppose 
a quite unrealistic mutual infall velocity (see [67, p.L112]) 


v © 4700 km/s > 0.01c 


of these clusters to guarantee that the collision does not last more than several 
billion years. The expected tidal tails were not observed. Moreover, the proposed 
infall velocity v has the opposite sign to the overall Hubble expansion speed of 
the universe. How could these two galaxy clusters get such unlikely high velocities 
several Gyr ago? This would produce an extremely large kinetic energy proportional 
to uv” in an almost isotropic and homogeneous universe, where the local speed of 
galaxies is usually only several hundred km/s. Therefore, any conclusion about the 
direct proof of a large amount of dark matter is questionable. 

The regions with dark matter in [67] are artificially colored in blue by some 
numerical simulations based on gravitational lensing. In the case of light bending 
near our Sun during total eclipses, we know exactly the bending angle. However, 
in the case of hypothetical dark matter regions we have to apply only some inexact 
heuristic algorithms and interpolation techniques between galaxies, since galaxies 
are seen projected on the celestial sphere and they are represented only by several 
pixels in photos. 


Chapter 9 4.) 
Dark Energy and the Local Hubble on 
Expansion 


9.1 Local Hubble Expansion 


Most cosmologists believe that the universe expands globally, but not locally, see 
e.g. [249, p. 719]. However, this immediately leads to a mathematical contradiction. 
We will demonstrate its main idea in the following example. 


Example 9.1 Suppose that the universe expands globally and, for simplicity, let us 
sketch it as an ellipse illustrated in Fig. 9.1. Then it has to expand in its left or right 
half. If it expands in the right half, for example, then it must expand in the upper or 
lower quarter. Let us therefore assume that it expands e.g. in the lower quarter. Then 
we again halve it. In this way, we can proceed as shown in Fig. 9.1 and obtain that 
the universe must expand at least somewhere locally. 


The above example is inconsistent with Kepler’s laws, and therefore also with 
the law of conservation of energy. For instance, all planets of the Solar system are 
sufficiently isolated from the gravitational influence of nearby stars. By (1.9) the 
gravitational force between the Earth and Alpha Centauri (whose mass is 1.1Mo 
and distance 4.37 ly) is about | million times smaller than the maximal gravitational 
force between the Earth and Venus. To illustrate the sparseness of our Galaxy, let us 
mention that if we reduce the Sun with diameter 1.391016 - 10° m to the size of a 
ping-pong ball with a diameter of 3.7 cm, then Alpha Centauri would be 


0.037 - 4.37 - 9.46 - 10° 


m =1.01-10°m 
1.391016 - 10° 


from us, i.e. slightly more than 1000 km. This shows how sparse our Galaxy is in our 
neighborhood. The diameter of the Galaxy in this scale would be 100 times larger 
than the Earth-Moon distance. 

Some authors argue (see e.g. [58, 70]) that dark energy does not manifest itself 
at all in the Solar system. From Sect. 7.8 we know why their reasoning is erroneous. 
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Fig. 9.1 An illustration that the universe has to expand somewhere locally. It can be easily 
generalized to the three-dimensional space 


Example 9.2 Let us rescale the value Hp to the mean Sun-Earth distance, i.e. one 
astronomical unit 


1 au = 149,597,870,700m ~ 150-10? m. 


Taking into account that | pe* 206,265 au and that one sidereal year has about 
31,558,150, we get 


70 - 31,558,150 
Ho © 70kms~!Mpc~! = 70ms~!kpe~! = ——————— -lauwl, 
0 s pe ms kpc 306,265,000 myear au 


Therefore, 


Ho * 10myear'au~!. 


From this we observe that | m? of space will increase by an average of 0.2 mm? per 
year, since 


(1+ a yx fe a PT Pa 
150-1097 150-109 

Hence, each cubic meter of the universe has increased its volume approximately 

twice on average since the origin of the Solar system 4.6 - 10° year ago. 


Michael S. Turner [352] is well known for coining the term dark energy in 1998 
for the force that is responsible for the expansion of the universe. There is no reason 
to assume that manifestations of dark energy do somehow avoid our Galaxy or 
the Solar system. Since the above values in Example 9.2 are relatively large, the 
influence of dark energy should also be found in the Solar system. 


9.2 The Universe Expands Almost as Rapidly as the Moon Recedes From the... 185 


Admitting the manifestation of dark energy in the Solar system, we can easily 
explain a wide range of puzzles such as the paradox of tidal forces of the Moon, 
the paradox of the large orbital angular momentum of the Moon, Titan, Triton and 
Charon, the existence of rivers on Mars and the existence of its satellite Phobos, 
migration of planets, the faint young Sun paradox [213], the formation of Neptune 
and the Kuiper belt of comets, the slow rotation of Mercury and the absence of its 
moons. 

In the following sections we present a number of astrobiological, astronomical, 
geometric, geochronometric, geophysical, heliophysical, climatological, paleonto- 
logical, and observational arguments to show that the Solar system and also our 
Galaxy are expanding at a rate comparable to the order of the Hubble-Lemaitre 
constant Ho (although usually a little bit smaller). From Sect. 7.8 we already know 
that the influence of the cosmological constant A is completely negligible on the 
local Hubble expansion. 

Finally, note that the current location of the American space probe Pioneer 10 
launched in 1972 does not conform to the laws of Newtonian mechanics. This is the 
famous Pioneer anomaly problem [17]. However, it has nothing to do with the local 
Hubble expansion and dark energy, since: 


1. It can be explained by radiation of an asymmetric position of heat sources in the 
probe. 

2. The corresponding time interval is too short compared to cosmological time 
scales. 

3. The additional acceleration has the opposite sign than the local Hubble expan- 
sion. 


9.2 The Universe Expands Almost as Rapidly as the Moon 
Recedes From the Earth 


In this section we show that the Moon moves away from the Earth more rapidly than 
would be gathered from classical mechanics. The hypothesis about tiny repulsive 
forces and the local expansion of the Solar system can be very well tested just by 
means of a precise measurement of the Earth-Moon distance. The Moon’s orbit has 
a small eccentricity (e = 0.0554). Thus the Moon slowly approaches the Earth and 
then moves away during each period. Current technologies allow us to establish 
long-term changes in the parameters of its orbit very precisely. 

Since the seventies of the last century, changes in the mean Earth-Moon distance 


D ® 384,402 km (9.1) 


have been carefully measured by laser retroreflectors installed on the Moon by the 
Apollo missions. 
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Fig. 9.2. Reduction of the angular velocity of the Earth’s rotation is caused by friction when 
creating tidal bulges, which are formed by the gravitational influence of the Moon. Since the Earth 
is not an ideal rigid body, its deformations are converted into heat 


The nearer side of the Earth to the Moon is attracted by a greater force than the 
opposite side. As a result of a nonconstant gravitational potential, tidal forces are 
formed. Note that tidal forces played an important role in the deposit of sediments. 
They provide a record of the lengths of the year and month over time as discovered 
by G.E. Williams [378]. Due to the Earth’s rotation, they give rise not only to ebbs 
and flows of the seas and oceans, but also to the lifting of the Earth’s mantle (see 
Fig. 9.2). Continual deformations of the Earth cause its rotation to slow down, and 
thus its spin (rotational) angular momentum decreases over time. Then from the 
law of conservation of total angular momentum it follows that the orbital angular 
momentum of the Earth-Moon system has to increase. 

Already in 1975, T.C. van Flandern [354] (cf. also [84], [88, Chapt. 6], [232], 
(335, p.155]) noticed that the Moon has a somewhat anomalous orbit, because 
it moves away from the Earth faster than can be explained by tidal forces and 
Newtonian mechanics. He examined whether it could be due to a slight decrease 
in the value of the gravitational constant. Long-term measurements show (see [82]) 
that the mean distance D continually increases on average 


v = 3.84cm per year. (9.2) 


Hence, the trajectory of the Moon is not an ellipse, but a very dense spiral with 
respect to the Earth-Moon center of gravity. However, the action of tidal forces 
on the Earth’s crust, hydrosphere, atmosphere, etc., can explain a much smaller 
expansion velocity — only about 55% of the value given in (9.2), see (9.20) and also 
[264, p. 67], [273, Sec. 9.10.4]. This is usually called the Paradox of tidal forces of 
the Moon, see [354]. Verbunt [365] even writes about the tidal catastrophe. 
Therefore, the Moon is receding from us probably not only due to tidal forces. 
For the remaining increment of almost the same order of about 1.71 cm per year 
we should look for other explanations. For instance, one possibility is that some 
apparent repulsive forces also contribute to the observed recession of the Moon. 
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The current expansion rate of the universe is characterized by the Hubble- 
Lemaitre constant, see Example 9.2 and (7.11), 


Ho ~ 10myear'au~! eoa eT 5, 


It is not difficult to recalculate the speed of this expansion only to the changing 
distance between the Earth and Moon. Denoting year to be the unit of time 
corresponding to one sidereal year and R = 1 au, then by (9.1) we have 


D 
Hy ~ 10myear ‘au! — 10, myear !D7! = 2.57 cm year !D~!. (9.3) 


We see that this value is surprisingly quite close to the mean value (9.2) of the 
recession speed of the Moon from the Earth. This is a remarkable coincidence. At 
the same time, the tiny repulsive forces can explain a larger measured recession 
speed (9.2) than that resulting from the tidal forces. 

Further, we show that from the law of conservation of the total angular 
momentum one can derive a recession speed of the Moon from the Earth due to 
tidal forces of only about 2.13 cm per year (cf. (9.20)) assuming a constant moment 
of inertia of the Earth. First, we will estimate in detail the contribution of tidal forces 
to the speed (9.2) by the method introduced in [160]. 

Consider the isolated binary system Earth-Moon with masses 


m, = 5.97219. 10% kg, mz = 7.3477- 10” kg (9.4) 


and assume, for simplicity, that their orbits are circular (estimation of the tidal 
influence of the Sun on the change in the Earth’s rotation will be discussed later). 
Then the corresponding distance (see (9.1)) can be expressed as follows: 


D=R,+ Ro, (9.5) 


where 


Dm 2 Dm, 
R, = ———— and Ry = ————_ (9.6) 
m, +m? mi +m 


are the distances of the Earth and the Moon from their Newtonian centre of gravity, 
respectively. 

By the conservation of the total angular momentum of this system (i.e. the sum 
of the spin angular momenta and the orbital angular momenta), the value 


J =o, + har +m,R,vy + m2R20v2 (9.7) 
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has to be constant, i.e., its time derivative is zero, 
» dJ 
J=—. 
dt 


Here v; and v2 are the orbital speeds of the Earth and Moon, respectively, relative to 
their center of gravity, [; and /> are the momenta of inertia of the Earth and Moon, 


9: 2. 
1 = — =7.292-105s3, a = — =2.669-10-°s"! (9.8) 
Ti Ty 


are the angular frequencies of the Earth’s and Moon’s rotations about their axes, 
T; = 86,164.1 s is the sidereal day, and 7, = 27.32166 7;. By BurSa and Peé [53] 
we have 


I, = (8.036 + 0.008) - 10°” kg m?. (9.9) 


We shall examine the sizes of the terms J;@; and J@2 appearing in the equation 
J = 0 (see (9.14) and (9.17) below). Consider the following three steps: 


1. Using the formula for the momentum of inertia of a homogeneous ball [296, 
p. 109], we find for the momentum of inertia of the Moon (whose density p(r) 
increases towards its center) the inequality 


4 2 
bh < Pui) -p2- 513 = 8.87: 10°+ kgm’, (9.10) 


where 
p2 = 3348 kg/m? (9.11) 


denotes the mean density of the Moon and r2 = 1737km is its radius. Note 
that the term J)@2 < 2.37 - 10°?kgm/?s~! corresponding to the Moon is much 
smaller than [j@, = 5.86 - 10°3 kg m’s—!. However, we have to compare their 
time derivatives. 

According to [315], the Earth’s rotation has slowed down so much during the 
last 2700 years that the length of the day increased on average about 1.7 ms per 
century, i.e. 


T, = 1.7- 10 sper year. (9.12) 


Estimations of the size of these long-term changes in Earth’s rotation caused 
by tidal forces were obtained by a thorough analysis of ancient Babylonian (see 
e.g. [264, p. 62], [315, p.270]), Arab et al. [273] records concerning the angular 
heights of the Sun during observed eclipses. This is in agreement with today’s 
precise radio measurements of the Earth’s rotation slow-down using distant 


9.2 The Universe Expands Almost as Rapidly as the Moon Recedes From the... 189 


quasars. Contemporary increased melting of glaciers, internal processes in the 
Earth, mass transfer in the atmosphere, hydrosphere, etc., cannot explain such 
a large value as given in (9.12). For simplicity, we will first assume that the 
momentum of inertia of the Earth /; is independent of time. In the next section 
we shall allow the time dependence /; = [ (ft). 

The Sun and Moon have practically the same angular diameter. Since tidal 
forces decrease with the third power of the distance from the Earth and the 
volume increases also with the third power, the tidal influence of each of the 
two bodies on the Earth is directly proportional to its mean density. Since 
Po = 1409 kg/m? due to (1.13), we find by (9.11) that the ratio of the densities 
of the Moon and Sun is about 3348 : 1409 = 2.38. The lunar tidal effect on the 
Earth is thus larger than the solar tidal effect. The actual ratio is slightly smaller 
than 2.38, since the mean angular size of the Sun is 31.98’, whereas for the Moon 
it is 31.07’. 

According to [282], the Earth’s rotation slows down about 68.5% due to the 
influence of the Moon’s tidal forces, and 31.5% from those of the Sun. Hence, 
the increase 7; = 0.6857 corresponds to the Moon and 0.3157 to the Sun. After 
1 year the angular frequency of the Earth’s rotation changes as follows 


20 


ee 9.13 
N+ we) 


@| 
From relation (9.12) and (9.13) we see that the corresponding time change of the 
Earth’s angular frequency is! 


pe ai 
T T ™(N +71) 


where 
T = 31,558,149.54s 


is the sidereal year. By (9.9) the corresponding change of the spin angular 
momentum of the Earth is equal to 


Ia, = —2.509 - 10! kg m? s~?. (9.14) 


2. The Moon also reduces its angular frequency about its rotational axis due to the 
recession speed (9.2) and the 1:1 resonance between the Moon’s rotation and 
the orbital period 7> of the Moon around the Earth. The Moon has fallen into the 
so-called tidal trap. 


'The correct calculation of the derivative @ requires the use of the measured value (9.12) 
of Earth’s rotation slowdown, and not a theoretically derived value 7 using the conservation law 
of the total angular momentum and relation (9.2) as done in [264, p. 65]. 


190 9 Dark Energy and the Local Hubble Expansion 


Now, we show that /2a@ is several orders of magnitude smaller than the value 
given in (9.14). Taking into account (9.5), we may apply Kepler’s third law which 
approximates the actual situation on short time scales quite well. By (1.10) the 
ratio D?/ ie is constant. Hence, the product w;D? is also constant by (9.8). 
Differentiating w;D? with respect to time, we obtain the differential equation 


20207 D> + 3w3D?D = 0. 
Hence, 


a. 3 D ith D_ 3.84em year! _ 6 gg 10-8ey-! (9.15) 
OSD NN" D 384,402km ne ee 


By (9.2) the mean long-term observed time change of D is given by 


3.84cm 
T 


Dictisceves = =1.2- 10~° m/s. (9.16) 


From this, (9.1), (9.8), (9.10), and (9.15) we obtain 
|In@| < 1.1-10®kgm?s~?. (9.17) 


Hence, the decrease of the Moon’s spin angular momentum is negligible in 
comparison to the value given in (9.14). 


. The decrease of the spin angular momentum of the Earth in (9.14) has to be 
compensated by the increase of the orbital angular momentum m Rj vj-+m2R2v2 
in (9.7). We will again employ the |: | resonance between the angular frequency 
of the Moon about its rotational axis and the angular speed of the Earth about 
their common center of gravity, i.e., @2 = v2/R2 = v;/R,. Consider further the 
Earth-Moon system whose center of gravity is at rest. Then by the momentum 
conservation law mv, = m20v2, (9.5), and (9.6), we get 


mRyvy +m2Rov2 = (Ri + Ro)myv, = Dmyv, 


2 m\m2 
= Dm, R\@2 = D* ——a? 
m, +m2 


From this we get by differentiating (9.7) with respect to time and neglecting [2a 
(cf. (9.17)) that 


mim2 d(D?a mim : 
Ko, = mie A 2) = alt (@D? + 2a@2DD) 
my+m2 dt m, + m2 


mim. wD . 
_— D, (9.18) 
mi +m, 2 
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where the last equality follows from (9.15). Substituting from (9.1), (9.4), (9.8), 
and (9.14), we finally obtain 


D = 0.674: 10-? mis, (9.19) 
which is only slightly more than one half of the measured value (9.16). 
Multiplying (9.19) by the length of the sidereal year T, we find that the Earth- 
Moon distance should increase by only about the value 


Utides © 2.13 cm per year. (9.20) 


Denote the difference between the measured value from (9.2) and the value (9.20) 
by 


Uremainder © 1.71 cm per year, (9.21) 
that is, 


VU = UVtides + Uremainder- 


The origin of the term U;emainder is unknown, but its magnitude is only a little bit 
smaller than the current value of the Hubble-Lemaitre constant in (9.3). This 
supports the hypothesis concerning the local influence of dark energy. More 
precisely, the speed 1.71 cm per year from (9.21) equals 67% of the value 2.57 cm 
per year of the Hubble-Lemaitre constant, as given in (9.3), i-e., the local Hubble 
expansion is 


H") — 0.67 Ho. (9.22) 


Such a large recession does not seem to be caused by tidal forces nor other 
nongravitational effects, as we shall see below and in further chapters. 
In 2003, Yurii Dumin (see [84, p. 2463]) derived from astrometric measure- 
t ; _ : (loc) 
ments corrected to ancient eclipses a similar local expansion rate of Hj)” *© 
0.5 Ho for the Earth-Moon system. In his paper [85] from 2008 this value was 
increased to ra = 0.85 Hp for the data from the last three centuries. 
Remark 9.1 By Kokubo et al. [149] our Moon was formed at a distance of only 
d = 20,000km from the Earth approximately 4.5 billion years ago. In this case, 
solar and lunar eclipses would have been much more frequent than at present, since 
the angular diameter of the Moon was about 10° * 2 arcsin(r2/d). The Moon 
was shining extremely brightly at that time, since its area on the celestial sphere 
was 400 times larger than now. Its orbital period was less than 6h by Kepler’s 
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third law (1.10). Moreover, tidal forces were huge. This corresponds to the mean 
recession speed 


v = 8cm per year 


to reach the current distance (9.1). Because tidal forces decrease with the third power 
of the distance, tidal forces played a significant role especially at the beginning. 
However, apparent repulsive forces could slightly but permanently increase the 
recession speed of the Moon from the Earth within the entire 4.5 Gyr long interval. 


9.3. Mars Was Much Closer to the Sun When There Were 
Rivers 


The total solar power incident per unit area of 1 m? perpendicular to the Sun’s rays 
at the distance of 1 au is equal to the solar constant 


Lo = 1.361kWm~”. (9.23) 


The value of Lo currently varies by less than 0.1% depending mainly on the number 
and size of sunspots. Hence, the total solar power is about 


Lo = 4m R*Ly = 3.828 - 107° W, (9.24) 


where R = | au. 

Since the Sun is a star on the main sequence of the Hertzsprung-Russell diagram, 
its position slowly rises to the left upper corner. However, in doing so, the Sun travels 
only a very short distance on the main sequence. According to [314, p. 461], the Sun 
had a surface temperature of 5586 K about 4.5 Gyr ago and its luminosity was only 
70% of the current value (corresponding to + 1 kW/m? at the distance | au from 
the Sun). Its luminosity increased approximately linearly with time up to the present 
(see Fig. 9.3). The current effective surface temperature is 5770 K and after 3 Gyr it 
will rise to © 5843 K, see [314, p.461]. At that time the Sun will already produce 
133% of its current power. Thus we see that the Sun’s surface temperature increases 
relatively slowly, while its power grows much faster. The Sun will become a red 
giant after 12 Gyr from its formation. 

Hydrologists estimated from the number of craters in dry river valleys 
(cf. Fig. 9.4) that Mars had liquid water on its surface 3-4 Gyr ago. At that time the 
luminosity of the Sun was only 75% of its present value. The solar power decreases 
with the square of the distance from the Sun. Therefore, the corresponding solar 
constant for Mars would only be 


150\2 Lo 
LMars = 0.75L0( 5) = 3. (9.25) 
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Fig. 9.3. Relative luminosity L/Lo of the Sun from the origin of the Solar system up to the present. 
The time f is given in Gyr. The bold interval on the time axis indicates the period when Mars had 
liquid water on its surface 


Fig. 9.4 Dry riverbeds on Mars tending to the ancient sea at the bottom right. The center of the 
image (whose dimensions are 175 x 125 km?) is at 42.3° south Martian latitude and 92.7° west 
longitude (photo NASA) 


provided that Mars had been on average 
r = 225-10°m (9.26) 


away from the Sun as it is now. In fact, the luminosity (9.25) corresponds to a 67%- 
decrease of the solar constant Lo given in (9.23). In this case the existence of rivers 
on Mars would be impossible. 

Note that a decrease of Lo by only 2% causes ice ages on the Earth, even 
though there is the greenhouse effect. An ancient atmosphere on Mars 3-4 Gyr 
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ago had one-third to two-thirds of the surface atmospheric pressure that Earth has 
today (see [124]). Higher concentration of CO2 (as suggested by Bertotti et al. [28, 
p. 177]) surely contributed to a higher surface temperature on Mars, but cannot fully 
explain liquid water there because of the huge 67%-decrease of the luminosity given 
in (9.25). Therefore, Mars must have been much closer to the Sun to account for 
liquid water. 

There is a possibility that Mars had escaped to freezing in the past due to 
a strong greenhouse effect, which could have compensated for the lower solar 
luminosity (9.25). However, the absence of carbonates in lake sediments according 
to analysis performed by the Curiosity Rover in Gale Crater of Mars [48] is 
casting serious doubts on this interpretation. For a rapid degassing of the Martian 
atmosphere during the first 400 million years we refer to [318, p. 16]. 

Mars also had a larger albedo than now, since there were water clouds feeding 
many lakes and hundreds of large rivers whose dry riverbeds are now between 
—50° and 50° of Martian latitude (see Fig.9.4). Due to measurements of the 
missions Viking I and II, Pathfinder, Spirit, Opportunity, Curiosity, Mars Orbiter, 
Perseverance, etc., we know that the current annual average temperature (about 
—63°C neglecting the greenhouse effect) on Mars is very much below the freezing 
point of water. 

By the Stefan-Boltzmann law (see [184, p. 173]), the equilibrium temperature Teq 
at the distance r from the Sun satisfies oT ey = Lo/(ar’), where Lo = 3.846 - 


1076 W is the present value of the total Solar luminosity and o = 5.669 - 10-8 
Wm ?K~‘ is the Stefan-Boltzmann constant. Since Mars’ surface area is four times 
larger than the area of its maximal cross-section, the current overall average surface 
temperature can be estimated by the Stefan-Boltzmann law as follows: 


1/(1— A)Lo\ 1/4 
Twas = 5(“—*"°) = 211K, (9.27) 


wor 


where A = 0.25 is the present value of the Bond albedo. We see that Tmars is really 
very close to the yearly planet-wide mean measured temperature ~ —60°C. When 
the Sun’s luminosity was 75% of the present value (see Fig. 9.3), we get from (9.27) 
that Tvars = 196K only. Note that if the temperature of water is 273.16 K and the 
pressure 611.7 Pa, then water can exist simultaneously in gaseous, liquid, and solid 
state. This is called a triple point (see Fig. 9.5). As a consequence we must conclude 
that Mars has been moving away from the Sun with an average speed of at least 
several meters per year (cf. (9.3)). 

When the Sun’s luminosity was only 0.75Lo, then similarly we would get r = 
116.782 - 10° km to reach the freezing point of water TMars = 273.15 K. However, 
this distance is more than 100 million km smaller than the current radius r = 225 - 
10° km. The infrared emissivity does not change these values too much. 

The above arguments show that it has no sense to develop some artificially 
complicated climate models for Mars and that Mars must have been closer to the 
Sun by several tens of million km when it had liquid water (Fig. 9.6). 
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Fig. 9.5 The triple point in the phase diagram of water. The average temperature and pressure on 
the Earth and Mars is indicated by a small bullet. Boxes show approximate ranges of temperature 
and pressure that can be achieved on the Earth and Mars. The vertical axis has the logarithmic scale 


Fig. 9.6 Pictures of sediments in the Gale crater taken by the Curiosity mission represent a further 
proof of liquid water on Mars during a long time period. An open problem is why all layers have 
almost the same thickness and which physical mechanism could produce such a perfect periodicity 
(photo NASA) 


By (9.3) recalculated to the Mars-Sun distance (9.26), we find that Mars 
could have moved further from the Sun by an amount of about 60 = 4- 10- 
225/150 million km during the last 4 Gyr. 
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Finally note that already in 1979, Maeder and Bouvier [230, p.88] studied 
the weak field consequences of scale invariance of gravitation and suggested the 
possibility that the recession speed of Mars from the Sun should be several meters 
per year (see also [229]). 

The associated average recession speed of cca 10 m/year (cf. (9.3)) of Mars from 
the Sun could be verified as soon as laser retroreflectors are installed on the Mars or 
Phobos. 


9.4 Orbital Expansion of Titan 


Recently it was found from two independent sets of astrometric and radiometric 
Cassini data that the mean recession speed of Titan from Saturn is 


v = 11.3 cm/year. (9.28) 


Before these measurements, astronomers assumed that this speed was only 
0.1 cm/year, because Saturn is mostly a gaseous planet and thus tidal forces are 
small. 

Lainey et al. [210] claim that this rapid recession speed (9.28) is mainly caused 
by a resonance locking mechanism of the five inner mid-sized moons of Saturn: 
Mimas, Enceladus, Tethys, Dione, and Rhea. Nevertheless, their total mass is only 
3 % of the mass of Titan. So if the unexplained recession speed of Titan is about 
0.1 m/year, the inner moons should approach Saturn by an amount that is on average 
approximately equal to more than 3 m/year (i.e., 3,000,000 km/Gyr) in order to 
keep the laws of conservation of energy and momentum valid. This would however 
contradict the present sizes of their orbits whose semimajor axes are between 
185,539-527,108 km. Moreover, the old cratered surfaces on the inner moons (see 
[259]) contradict their implied young ages. 

Therefore, in [181] another explanation is given. It is again based on tiny 
repulsive forces which could be due to gravitational aberration, cf. [160], [176], 
(231, p.25]. Since the Titan-Saturn average distance is d = 1,221,870km, the 
value (9.28) results in 


d 11.3 cm/year 8-4] 
= ~0.925-1 2 
Z iim ee) 


and by means of (9.3) the recalibrated value of the Hubble-Lemaitre constant is 
Ho = 8.15 cm year !d7!, 
Therefore, the corresponding expansion rate is 


11.3 

(loc) 

H = — Ap = 1.38 Ao. 9.30 
0 3150 0 (9.30) 
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This value, of course, includes tides which have to be subtracted as in the case of our 
Moon, see (9.22). Moreover, resonances with mid-sized inner moons should also be 
taken into account. 


9.5 Expansion of the Solar System 


In Sects. 10.3 and 10.4 we will present several arguments supporting the hypothesis 
that repulsive forces also significantly contribute to the increase of the mean Earth- 
Sun distance by several meters per year. Below we present several other arguments 
indicating the expansion of the Solar system. 


Remark 9.2 The rotation of Mercury is very slow (59 days) which could be caused 
by a very large impact in the very early stage of its development. However, Mercury 
has no tectonic activity that would renew its 4.5 billion years old surface which is 
now uniformly dotted only with relatively small craters. Another possibility is that 
its slow rotation is due to tiny repulsive forces. We hypothesize that Mercury was 
once closer to the Sun, and then due to these forces it slowly moved to a higher orbit 
with a semimajor axis of about 57.9 million km. 

Because tidal forces are decreasing with the cube of the distance from the Sun, 
they act on Mercury by a (149.6/57.9)? ~ 17 times larger value per unit mass than 
on the Earth. Moreover, if Mercury were e.g. only 40 million km from the Sun at 
the time of its creation, which is in accordance with (9.3), then the tidal forces from 
the Sun would be (57.9/40)* ~ 3 times greater than today. Altogether, we get a 
3-17 = 51 times larger tidal influence per unit mass than that acting on the Earth 
at present. This would significantly slow the rotation of Mercury, which has about a 
100 times smaller moment of inertia than the Earth. 

Mercury could also be closer to the Sun due to the absence of its moons whose 
orbits would be unstable there. 


Remark 9.3 There is also an open problem concerning how Neptune could be 
formed as far away as 30 au from the Sun, where the original protoplanetary disc 
was relatively sparse and all motions would be very slow (by Kepler’s third law 
its mean speed is only 5.4km/s). Assuming that the average Neptune-Sun distance 
increases roughly at the rate of Ho, we find by (9.3) that Neptune could have been 
formed many astronomical units closer to the Sun 4.5 Gyr ago. 


Remark 9.4 There is a well-known puzzle, how the large moon Triton with a diam- 
eter of 2705 km was captured so far from Neptune. Triton is a retrograde moon, 
since it orbits in the direction opposite to the direction in which Neptune rotates. 
Thus the spin angular momentum of Neptune has the opposite sign than the orbital 
angular momentum of the Neptune-Triton system. Hence, Triton should continually 
approach Neptune to keep the law of conservation of angular momentum. Triton 
has probably orbited Neptune for a very long time, because its present generally 
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recognized orbital eccentricity is incredibly small, namely, 
e = 0.000,016. (9.31) 


This is the smallest eccentricity of all known bodies in the Solar system. When 
Triton was captured during some N-body collision, its orbit was almost certainly an 
elongated ellipse and it took billions of years for Triton to reach an almost circular 
orbit due to tides. Its present radius a = 354,759 km is very large. 

Using (9.31), we find that 


V1 —e2 = V1 — 256- 10-12 = 0.999,999,999,872. 


From this and (1.2) we get for the difference of the semimajor and semiminor axes 
that 


a—b=a(l—V1 —e?) = 354,759 km x 0.000,000,000, 128 = 4.5 cm. 


Is it really possible that (9.31) would be some measured value without any error 
estimate? 

We claim that Triton was captured closer and then migrated from Neptune, since 
the apparent repulsive force are bigger than tidal forces that act in opposite direction. 
This could explain the current very large orbital angular momentum of Triton. 


Remark 9.5 Similar arguments can be applied to the Pluto-Charon system. Since 
this system is completely locked into the 1:1:1 resonance, tidal forces are almost 
negligible and its large orbital angular momentum can again be due to tiny repulsive 
forces. 


Remark 9.6 We know 11 rapidly-moving satellites that orbit Uranus below its 
stationary orbit with radius 82,684km. The average difference between radii of 
their neighboring orbits is only 2663 km. The distances of such neighbors above 
the stationary orbit are much bigger (cf. Fig.9.7). The orbit of the satellite Puck 
has a radius of 86,010km, followed by Mab with 97,700km and Miranda with 
129,390 km. This is because for satellites below the stationary orbit, the tidal forces 
and apparent repulsive forces are mutually subtracted, while for the satellites above 
the stationary orbit they are summed. A similar statement holds also for rapidly- 
moving satellites of Neptune. For more details concerning satellites below and 
above the stationary orbit of Uranus and Neptune, see [184, p. 226]. 


In [184] we present several other examples suggesting that gradual expansion can 
also be observed in other smaller subsystems of the Solar system. This apparently 
contradicts the classical Newtonian law of conservation of energy. 
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Fig. 9.7 Rapidly-moving satellites of Uranus. Below the stationary orbit the tidal forces and 
apparent repulsive forces are subtracted, since they have opposite directions, whereas above this 
orbit they are summed up coherently. The distances between neighboring satellites above the 
stationary orbit are substantially greater than under it 


Emmy Noether was a leading German mathematician who proved the following 
theorem [263], which completely changed the face of physics: 


Theorem 9.1 (Noether) The energy of each isolated system is conserved if it 
possesses symmetry with respect to time translations. 


If the trajectories of the moons of planets in the Solar system develop in time 
along spirals, then this may indicate the existence of apparent repulsive forces that 
would violate the law of conservation of energy. The symmetry with respect to 
time translations is not true for spiraling trajectories. Similarly, another theorem of 
Noether states that spatial symmetry under rotation, or rotational invariance, leads 
to the conservation of angular momentum. This symmetry is also slightly violated 
in our case. Finally, she also proved that spatial translation symmetry ensures that 
momentum is conserved. 


9.6 Gravithermal Catastrophe 


Almost all open and globular clusters seem to be unstable, as was shown by 
Pavel Kroupa [204]. Smaller stars tend to move away from the center of a cluster, 
while the massive ones gradually accumulate around the center. This leads to mass 
segregation. During multiple collisions light-mass stars are sometimes even ejected 
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Fig. 9.8 Globular cluster M13 in the constellation Hercules (photo NASA) 


from the cluster which is usually called evaporation of stars in the clusters. The 
surrounding, heavier stars drift closer to the center due to loss of potential energy. 
This process ends by the so-called gravithermal catastrophe (see [115]). 

However, the local Hubble expansion should cause all the stars on average to 
move away from the center. Hence, it would act against the gravithermal catastrophe 
inside the cluster and it would slow the entire process. 

There exist about 150 globular clusters in the Milky Way, but some other galaxies 
contain thousands of globular clusters. They are very old systems and contain 
hundreds of thousands or up to millions of stars, see Fig. 9.8. Some globular clusters 
have existed for more than 13 billion years and their gravithermal catastrophe has 
not been fully manifested, yet. 


9.7 Do Single Galaxies Expand? 


There is no reason to assume that apparent repulsive forces would somehow not 
be present in the interior of galaxies, since their manifestations are observed not 
only at large cosmological distances, but also inside the Solar system as shown in 
previous sections, see also [335, p. 154]. In spite of that it is generally claimed that 
galaxies do not expand, because they are gravitationally bound and that only the 
space between them expands. However, galaxies are usually included in clusters 
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that should also not expand, because they are gravitationally bound as well. Galaxy 
clusters are gravitationally bound again in superclusters, cf. [262]. So where does 
the universe expand? 

According to [235, Sect. 8], the observed conservative expansion rate of the 
Milky Way is 0.6—1 kpc/Gyr, which is approximately 0.6—-1 km/s, since 1 kpc 
= 3.086 - 10!©km and 1 Gyr = 3.156 - 10!° s. This expansion rate nicely fits to 
the Hubble-Lemaitre constant recalculated by (6.31) on the radius 


1 
R= 50,000 ly = 15,328 pe = 70 Mpc 
of our Galaxy, namely, 
Ho © 1km/(s R) 


and 


R_ 1kpce/G 
ee =e. 0 at 
R_ 15,328 pe 


This value is again comparable with similar ratios (9.15) and (9.29) which include 
effects of tidal forces. 

Many other astronomical observations document the expansion of galaxies 
themselves. For instance, Bouwens et al. [39] found that galaxies grow slowly. 
Also Ignacio Trujillo with his team [350] discovered that the size of massive 
galaxies increases with time. This increase can, of course, be partially explained by 
intergalactic gas and dust that falls into galaxies due to gravity, as well as galactic 
cannibalism. However, galaxies at cosmological distances have on average more 
stars per unit volume when compared with the present situation, i.e., this is in line 
with the proposed local Hubble expansion on galactic scales. 

By Ferré-Mateu and Trujillo [102] superdense galaxies were quite common in 
the early universe with redshift z > 1.5. At present they are quite sparse. Papers 
[357, 362, 376] also suggest that early galaxies were smaller and denser just after 
their formation. According to [51], the mass density of some galaxies for z > | is 
even comparable with the density of globular clusters themselves, i.e., on average 
several stars per pe?. On the basis of this enumeration (see also [75, 119, 349], 
etc.) we may suppose that repulsive forces essentially contributed to the above- 
mentioned expansion and thus led the interiors of the galaxies themselves to become 
more tenuous. Similarly as in the Solar system, the rate of expansion of galaxies 
themselves may be smaller than the Hubble expansion, but it may also have the 
same order. This would support the picture that galaxies “swell up” very slowly like 
sea sponges or bread rising in the oven. 

Further, we introduce a geometrical argument from [184, p. 243] which is based 
on proof by contradiction. Suppose for a moment that galaxies have constant volume 
(i.e., they do not expand with time) and that they do not collide. The right part of 
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Fig. 9.9 A schematic illustration of nongrowing galaxies of constant size over time in an 
expanding universe. The unit cube is on the left. It contains several galaxies in our neighborhood 
for the redshift z = 0. The distribution of galaxies at cosmological distance z = 2 is in the middle 
and for z = 4 on the right. Such a picture of tightly crowded galaxies has not been observed by 
astronomers 


Fig. 9.9 illustrates what we would observe at cosmological distances, if galaxies 
would have the same size at all times and no new galaxies were formed. On the 
left, five galaxies are schematically depicted in the unit cube for z = 0. Thus for 
z = 2 there would be on average 5(z + 1)? = 5 - 3° = 135 crowded galaxies in the 
unit cube, since the middle cube from Fig. 9.9 can be placed into the left unit cube 
33 = 27 times. Analogously we find that for z = 4 there would be 5 - 5? = 625 
tightly packed galaxies in the unit cube. They could touch or even penetrate each 
other, if their sizes would be constant st all times. However, such a tight arrangement 
is not observed in various Hubble Deep Fields, since galaxies were smaller at that 
time. 

We may also argue in the following way: The volume of our Galaxy is about 
10° x 10° x 10° ly?. The total volume of all 10!* observed galaxies can thus be 
approximated as 107° ly?. Hence, if the radius of this part of the universe was smaller 
than 10° ly and protogalaxies were to have the same size as today’s galaxies, they 
would necessarily interpenetrate each other, which would lead to a contradiction. 

The above arguments show that galaxies themselves grow (see Fig.9.10) 
although probably by a somewhat lower rate than the Hubble parameter. The 
repulsive forces (sometimes called dark forces) slightly, but continually generate 
energy in any system of free bodies that interact gravitationally [198, p. 276]. They 
increase its total (i.e. kinetic + potential) energy [265]. These forces thus act locally 
in our Galaxy and in other star systems. 
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Fig. 9.10 A two-dimensional model S? ,) of an expanding universe with comoving and swelling 
galaxies. On the other hand, in [78, 79], and [249, p.719] galaxies have constant sizes over time. 
The center of this “inflating rubber balloon” corresponds to the Big Bang at an initial time 


Remark 9.7 The angular momentum of spiral galaxies is also not conserved, see 
e.g. [217]. This is naturally expressed by the galactic angular momentum paradox: 

How is it possible that spiral galaxies (originating from small random fluctua- 
tions in a hot homogeneous and isotropic universe) rotate so fast? 

Danail Obreschkow even writes about the angular momentum catastrophe, see 
[265, p.4]. How do galaxies gain their extremely large angular momentum? The 
answer that comes easily reads: From the local Hubble expansion. The kinetic 
energy hidden in each of the individual instances of the billions of rotating spiral 
galaxies is enormous. Local as well as global expansion increases the total potential 
energy. This is similar to the increase in potential energy of a comet moving away 
from the Sun. 


Remark 9.8 By Hulse and Taylor [136] the orbital period of close binary pulsars is 
shortened. In this case, the binary system creates a strong and rapidly changing 
gravitational field in a curved spacetime. According to the GTR, the system 
loses energy in the form of gravitational waves. Since pulsars have extremely 
strong magnetic fields, the total energy can partly decrease also in the form 
of electromagnetic waves. These phenomena predominate over the local Hubble 
expansion. 

Finally, note that the gravitational potential behaves like C,r~! while the 
magnetic potential like Cyr. Therefore, for sufficiently small r we have 


Ci OG 
—< 
r r 


This implies that magnetic forces play a crucial role in very close encounters 
between binary pulsars. 


Chapter 10 Mm) 
Anthropic Principle and the cree | 
Hubble-Lemaitre Constant 


10.1. Weak Formulation of the Anthropic Principle 


According to the weak formulation of the anthropic principle, all fundamental 
physical constants have just such values that enabled the origin of life, see e.g. 
[26, 57, 60, 61, 184]. This hypothesis is, of course, untestable, because we cannot 
change values of these constants and observe how the universe would evolve 
afterwards. 

The strong formulation of the anthropic principle postulates that evolution 
necessarily leads to the origin of humans (= Anthropos in Greek). However, we may 
immediately raise the following objection against this formulation. If the asteroid 
that caused the extinction of the dinosaurs had missed the Earth, then the human 
race would never have developed, even though it cannot be denied that there would 
possibly be other intelligent beings. Note that there exist also other definitions of 
the strong and weak anthropic principles. Nevertheless, it is very surprising that all 
fundamental physical constants have exactly such values that allow the birth and 
development of life in the universe. 

In this chapter, we demonstrate that also the current value of the Hubble-Lemaitre 
constant essentially contributed to the existence of humankind, even though it is 
not a fundamental constant [202]. Life on Earth has existed continually for at least 
3.5 Gyr and this requires very stable conditions during this quite long time interval. 
Nevertheless, due to the local Hubble expansion and since the luminosity of the 
Sun increases, the Earth receded from the Sun by an appropriate speed such that it 
received an almost constant solar flux during the last 3.5 Gyr. 
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Remark 10.1 No physical constant should be considered as a standard mathemati- 
cal constant. For instance, the irrational numbers 


mw = 3.1415926535... (Ludolf number), 
e = 2.7182818284... (Euler number), 


V2 = 1.4142135623... 
have infinitely many digits. On the other hand, the Newton gravitational constant 


G = 6.674 + 0.001 - 107!! m3 kg™!s~?. 


is not a real number with an infinite decimal expansion. Its fourth significant 
digit is close to four, but the other digits are not known. In the future it will be 
impossible to find e.g. one thousand digits of G, since physical constants have 
a completely different character than real numbers. Physical constants should be 
rather treated as “fuzzy numbers” or “interval arithmetic numbers” or the “density 
of some probabilistic distribution function”. The reason is a special character of all 
physical measurements and the Heisenberg uncertainty principle which prevents us 
from obtaining infinitely many true decimal digits. 


Anyway, the value of the gravitational constant G has a significant influence 
on the interior temperature of a star, its age and size, its luminosity, and many 
other parameters. The reason is that the product GM is proportional to the pressure 
inside the star, where M is its mass. If G were to be only one part per thousand 
smaller or larger than its present value, then all stars, galaxies and also their clusters 
would evolve in a different way, and thus the Earth could not come into being as 
it is. Unfortunately, we cannot perform any experiments to test the evolution of the 
universe for a different value of the gravitational constant G. 

Similar statements are valid also for other fundamental physical constants like the 
speed of light c in a vacuum,! the mass of the proton Mp, the proton-to-electron mass 
ratio Mp/me © 1836.152673, the elementary charge e (of the electron), Planck’s 
constant h, the vacuum electric permittivity ¢9, the Avogadro constant Na, the 
Boltzmann constant kp, etc. On the other hand, the dimensionless constant of fine 
structure a = e?/(2eghc) x 1/137, the reduced Planck constant A = h/(27), 
the vacuum magnetic permeability constant 49 = 1/(eoc7), etc., should not be 
called fundamental, since the right-hand sides of their definition expressions already 
contain fundamental constants. Another way of proceeding would be to use a 
different (but equivalent) set of fundamental constants, where we replace e.g. 9 
by a. 


' We can always choose physical units so that c = 1, G = 1, e9 = 1, etc., instead of the standard 
SI units. 
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Some physical constants that characterize, for instance, ordinary water also have 
remarkable values. It is well-known that water has its highest density at 4°C under 
normal pressure. As a result, in winter, lakes do not freeze from the bottom. Contrary 
to other substances, ice has a lower density than liquid water. Thus, ice floats on 
water, which protects the fish and other aquatic animals from freezing. Water also 
has the highest heat capacity of all substances which significantly contributes to a 
high stability of the temperature of Earth’s surface. Nevertheless, particular values of 
these physical constants are only side effects of the fundamental physical constants. 
Hence, the values of c, mp, e, h, €0, etc., are so perfectly tuned that they allow water 
to have such advantageous properties suitable for the existence of life. 

The structure of DNA with covalent bonds in both its strands and with hydrogen 
bonds between adenine (A) and thymine (T), and cytosine (C) and guanine (G) 
essentially depends on the interaction diameters of elementary particles and also on 
the size of the elementary charge e. This value plays a crucial role in replication 
and transcription processes [186]. The translation process: RNA — PROTEINS in 
ribosomes depends on e, too. Other fundamental constants allow a proper forming 
of the helix shape of DNA as well. Note that each cell has about 2 m of DNA strands 
which do not tangle during replication and transcription. Notice further that the total 
length of DNA in each human is about 10!? m which is more than the Earth-Pluto 
distance. 

The two hydrogen bonds between adenine and thymine (see Fig. 10.1) are 
contained in the so-called TATA-box which precedes the start codon. Therefore, this 
box can be more easily split by the enzyme RNA polymerase during transcriptions 
than the three hydrogen bonds between cytosine and guanine, cf. [368, p. 966]. 
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Fig. 10.1 Schematic illustration of the chemical structure of DNA: atoms of carbon are marked 
by large black dots, atoms of hydrogen by small black dots, covalent bonds are shown as solid 
lines, and hydrogen bonds as dotted lines. At both edges there is a solid skeleton of sugar 
phosphates, which protects the genetic information from damage. In these edges the deoxyribose 
sugar alternates with the phosphate group PO4 
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Assigning A= 00, C= 01, G= 10, and T= 11, we observe that genetic 
information can thus be directly transcribed into binary digital system that was 
actually first discovered by nature 3.5 Gyr ago, i.e. much earlier than humans [186]. 

It is also noteworthy that the main components of all organic compounds are the 
most frequent elements in the universe (except for the inert gas helium). The most 
frequent element is, of course, hydrogen H, oxygen O is third, carbon C fourth, 
and nitrogen N fifth. They have small nuclei which supports a large variability of 
organic molecules. The nucleus of Si is too large in comparison with C to form large 
complex molecular structures. 

A suitable combination of the above-mentioned fundamental constants also 
guarantees an incredible stability of the Sun for about 10Gyr. By the formula 
E = mc?, the Sun’s mass losses due to nuclear reactions are 4.26 x 10° kg/s. Since 
Mo = 1.989 x 10°° kg, we find by (9.24) that 


Lo 
— = 0.0002 W/kg 
Mo 


on average. (For comparison, the human body produces about | W/kg.) Conse- 
quently, nuclear reactions inside the Sun are incredibly well tuned so that the Sun 
was shining a very long time (~ 4.6 Gyr) which allowed the development of life 
and then the origin of humankind. 

This process in the central part of the Sun can be very roughly described as 
follows. The weak interaction between two colliding protons of the same spin 
followed by the strong interaction is responsible for the formation of a deuteron. 
However, this collision is extremely unlikely, since the binding energy of one 
deuteron is only ~ 1.1 MeV per nucleon. After that the strong interaction between 
two deuterons causes the formation of helium nuclei (i.e. alpha particles). Their 
binding energy is much larger, namely, ~ 7.1 MeV per nucleon. 


Remark 10.2 The expansion speed of the universe is also a very important quantity 
in the context of the anthropic principle. If it would be too high, galaxies would 
not appear. If it would be too small, our universe would gravitationally collapse and 
life would not have enough time to arise. The current expansion speed essentially 
depends on the initial conditions during the Big Bang. 


There are many different definitions of living organisms. Max Tegmark [344, 
Chapt. 1] reduces the definition of life to a process that can retain its complexity 
and replicate itself (i.e. a self-replicating information-processing system). It is 
astonishing that the origin of life on Earth occurs at almost the same time as the end 
of late heavy bombardment of the Earth by asteroids ~ 3.8 Gyr ago (see Fig. 9.3). 
Note also that life probably did not arise instantaneously, but it could very slowly 
evolve over thousands or millions of years in water. During this period one cannot 
uniquely state if some cluster of molecules is alive or not. We also cannot rule out 
an extraterrestrial origin of life. 

Let us emphasize that Ho represents only some average value over large 
structures. The repulsive forces contribute to the expansion of habitable zones (as in 
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the case of the Earth), while the luminosity of the mother star is gradually increasing. 
In this sense, the habitable zones are then more stable, since they may exist for a 
longer time period. 


10.2. The Faint Young Sun Paradox 


Let us first recall what is meant by the faint young Sun paradox. Assume for a 
moment that the Earth was at its birth about 1 au away from the Sun as it is now. 
In order to not be completely frozen and to be able to develop life on its surface, 
the Sun at its origin had to be approximately as hot as it is now. However, this is 
not in agreement with the fact that the Sun as a star on the main sequence of the 
Hertzsprung-Russell diagram had a lower luminosity at its origin (cf. Fig. 9.3). This 
leads to the paradox usually referred as the faint young Sun paradox [213]. The 
mean temperature on the surface of Earth would have been much below the freezing 
point of water, in contrast with the absence of glaciation in the first 2.7 Gyr of its 
existence (see [28, p. 177]). 

The faint young Sun paradox is, in fact, more severe due to ice-albedo feedback 
of the frozen ocean. To prevent the Earth from freezing over, a much higher 
concentration of CO than today is assumed in [145]. This oxide is at present 
deposited in CaCO3. There is evidence supporting the hypothesis that the Earth 
during its history was a “snowball” several times (see [114, p.58]) but, on the other 
hand, sometimes the temperature of the oceans was much higher than today [224]. 
Of course, the composition of the atmosphere and also the greenhouse effect played 
a significant role. 


Remark 10.3 Tf Mars were much closer to the Sun by tens of millions of kilometers 
(see Sect. 9.3), then our Earth would also have to be closer to the Sun. Otherwise 
these planets could have close encounters and their orbits would not be stable over 
time. 


In the following sections we present further independent arguments suggesting 
that the Earth was much closer to the Sun in the past and that their average distance 
from each other increases by a speed of several meters per year. Assuming that this 
is due to the repulsive forces, the faint young Sun paradox ceases to be a mystery 
[86]. 


10.3. The Expansion of the Ecosphere 


The Earth is located inside the so-called habitable zone, i.e., the area in which liquid 
water is present permanently. In this section and mainly in the next Sect. 10.4 we 
estimate such an expansion speed of the Earth from the Sun to ensure a constant 
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Fig. 10.2 Schematic 
illustration of the expansion 
of the ecosphere during the 
last 3.5 Gyr, where 

R(to) = 1.3-10!! m, 

R(O) = 1.5- 10!! m, and 


Ro) 
to = —3.5 Gyr ae 


R(O) 


solar energy flux for 3.5 billion years, which would provide favorable conditions for 
the development of life on our planet. 

So far, unfortunately, we cannot measure the actual average expansion speed of 
the Earth from the Sun with an accuracy on the order of a meter per year, because 
the position of the center of gravity of the Solar system shifts every day by about 
1000 km primarily as a result of the gravitational influence of the giant planets, see 
Fig. 4.4. Therefore, we will consider the very long time interval of 3.5 billion years 
of life on the Earth. 

Note that the Earth’s elliptic orbit has a very small eccentricity e = 0.0167. The 
semimajor axis a = 149.598 - 10°km and the semiminor axis b = avV/1 — e? = 
149.577 - 10° km do not differ before the fifth significant digit, i.e., Earth’s orbit is 
almost circular. 

To ensure favorable conditions for life on Earth it is necessary at present that 
the Sun’s luminosity be at most 5 % larger or smaller than the solar constant Lo 
given by (9.23). Such a ring (resp. spherical layer) is called an ecosphere. Since 
the energy flux from the Sun decreases with the square of the distance, its radii 
are (0.95) !/ > au and ( 1.05)!/ ? au, which corresponds to a very narrow interval of 
145.8—153.3 million km (see Fig. 10.2). If the Earth’s elliptic orbit were to leave this 
ring for a long time period, it would have disastrous consequences for life on our 
planet. A permanent reduction of the Sun’s luminosity of more than 5 % would cause 
overall glaciation of the planet. On the other hand, at temperatures over 57°C DNA 
sequences of mammals decay. Even at temperatures above 50°C mitochondria stop 
producing adenosine triphosphate (ATP) that supplies chemical energy to eukaryote 
cells. 

When life appeared on Earth (i.e. 3.5 billion years ago), the Sun’s luminosity was 
about 77 % of the present value (see Fig. 9.3). To ensure a favorable climate for the 
long-term evolution of life, where liquid water is necessary, the Earth was probably 
tens of millions of kilometers closer to the Sun at that time. This claim is supported, 
for example, by data on the occurrence of fossil thermophilic bacteria, from which 
it is believed [224] that the temperature of oceans was about 80°C three and half 
Gyr ago. Due to higher volcanic activity at that time it is, however, not clear to what 
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extent this may only be a selection effect. On the other hand, formula (9.27) applied 
to the Earth yields a quite large discrepancy in temperatures, namely, for given a 
total solar luminosity of 0.77Lo, r = | au, and a present value for the mean Earth’s 
albedo of A = 0.306, we get the mean temperature —35°C. 

Heat produced by decay of radioactive isotopes also contributed to a higher 
temperature during the first few hundred million years of the Earth’s existence, 
when elements with a relatively short half-life decayed. According to current 
measurements, the heat flow from the Earth is less than 0.1 W/m2. It could not 
have been much larger 4Gyr ago, because the half-lives of the contemporary 
natural radioactive isotopes 232TH 238, 40K, and 255U are 13.9, 4.468, 1.248, 
and 0.704 Gyr, respectively [77]. It is therefore estimated (see [114, p.58]) that the 
heat flow from the Earth 4Gyr ago was no more than 5 times larger than today. 
Nevertheless, this heat flow (mainly from diverse geothermal sources) is completely 
negligible when compared to the solar constant (9.23). 

Although the Sun’s power grew approximately as illustrated in Fig. 9.3 the Earth 
should have relatively stable conditions that are needed for the development of life 
during the period lasting 3.5 Gyr. 


10.4 Two-Sided Error Estimates 


The faint young Sun paradox can well be explained by the tiny repulsive forces that 
are responsible for the local Hubble expansion, see [161]. First we show that for the 
constant average speed of the Earth moving away from the Sun, cf. (9.3), 


v = 5.2m/year, (10.1) 


the Earth would receive an almost constant density of flux of energy. In detail, the 
velocity UV is optimal in the sense that the Earth receives practically the same flux of 
energy from the Sun as the solar constant Lo (defined by equality (9.23)) over a very 
long time period of 3.5 billion years. Namely, the optimal flux Lop: will correspond 
to the particular expansion rate (10.1). 

In order to prove this (see Theorem 10.1 below), we put 


to = —3.5 Gyr 


and t = O will again correspond to the present time. The main results of this 
section are formulated in the form of mathematical theorems to make it clear what 
is assumed and what is claimed. The symbol Vt means “for all ?’”. 

Since the luminosity of the Sun increases approximately linearly with time and 
it was only about 77 % of its present value 3.5 Gyr ago (see Fig. 9.3), we set 


L(t) = (1 = 0.23--) Lo Vt € [to, 0], (10.2) 
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i.e. L(to) = 0.77 Lo and L(O) = Lo. As the luminosity decreases with the square of 
the distance, we can state the following quite surprising theorem. 


Theorem 10.1 (Optimal Recession Speed of the Earth from the Sun) Set 


ee La (10.3) 
opt — (R + 0t)2’ 0 ’ . 
where R = 1 au and Vv is given by (10.1). Then 
|Lopt(t) — Lo| < 0.005 kW m7 Yt € [fg, OJ. (10.4) 


Proof The very small dispersion of luminosity +0.005 kW m* on the right-hand 
side of formula (10.4) can be easily derived analytically by investigating the rational 
function Lop(t). It is concave on the whole interval [fo, 0], By inspection we find 
that its minimum is attained in the left end point to, Lopt(to) = 1.357387... and 
the maximum of the function Lop, is attained in t* ~ —1.7- 10° year, Lopt(t*) = 
1.364574... Hence, the following two-sided estimate holds (cf. Fig. 10.3): 


1.355 < Lopt(to) = min Lopr(t) < max Lopr(t) < 1.365 
te[to,0] te[to,0] 


and Lop (0) = Lo. oO 


We do not claim that the Earth recedes from the Sun just by the speed (10.1), but 
with a speed whose size has order comparable with v. The luminosity flux 


1.36+0.005kWm? Yt € [to, 0] 


would, of course, guarantee very stable conditions (10.4) for the development of 
intelligent life on Earth over the very long period of 3.5 Gyr. In particular, the local 
Hubble expansion seems to be just right for an almost constant influx of solar energy 
and thus also for the appearance of humankind. 
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Fig. 10.3 Plot of the almost constant function t +> Lop(t) over the interval [7o, 0]. The vertical 
axis is substantially shortened for the clarity of the presentation 
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The speed (10.1) corresponds to the mean local Hubble expansion 


Hf? = 0.52 Ho. 


Due to a similarity with the order of the Hubble constant (9.3), we propose that 
the above recession originates from the apparent repulsive forces introduced in the 
previous chapter. These forces thus cause a secular outward migration of our planet 
on the order of meters per year, so that it stays within the expanding ecosphere (see 
Fig. 10.2). If they were not to act, then conditions favorable for the development 
of life on Earth would exist for only about 1 billion years. Intelligent life would 
not have had enough time to develop due to a continual rise in temperatures (see 
Fig. 9.3). 

The local Hubble expansion thus represents further support for the (weak) 
anthropic principle, which states that basic physical constants are favorable to the 
emergence of life only if they are in very narrow intervals [161]. Moreover, the 
speed in (10.1) is optimal in the sense that any other slightly different speed would 
not yield an almost constant flux expressed by the rational function in (10.3) on the 
time interval 3.5 Gyr. Thus it is probable that the actual average recession speed of 
the Earth from the Sun oscillates about the value 5.2 m/year (see Theorems 10.2 
and 10.3), e.g. due to the influence of other planets. 

Now, for a variable continuous recession speed v on the interval [fo, 0] we define 
similarly to (10.3) the associated luminosity 


2 
L(v, t) = ve x, € €[fo, 0], (10.5) 


(R aif v(6)d8 ) 


where tg = —3.5 Gyr, R = 1 au, L(t) is given by (10.2), and the integral of the 
velocity yields the distance. In the case that we are not able to establish a proper 
value of some quantity, the so-called two-sided estimates may be quite useful (see 
e.g. [193, 364). 


Theorem 10.2 (Two-Sided Estimates) [f the recession speed v = v(t) of the Earth 
from the Sun lies in the interval from 4.26 m/year to 6.14 m/year for every t € [to, 0], 
then the luminosity defined by (10.5) changes at most about 5 % from Lo, namely, 


0.95Lo < Liv, t) < 1.05L9 Vt € [to, 0]. 


Proof By inspection we find that for the constant velocity vj) = 4.26 m/year the 
rational function t + L(v1, f) is increasing on [fo, 0] and thus by (10.5), 


L(t) R? L)R* 
0.95Lo = ———]{ _ = L£(u1, to.) < = LY, 10.6 
Ore an? (v1, to) < (R+ ve (v1, 0) (10.6) 
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for any t € [fo, 0]. Analogously, for the constant velocity v2 = 6.14 m/year, we get 
that tf > L(v2, t) is decreasing, and therefore, 


L(tg) R2 


L(v2, t) < L(v2, 9) = ———_,, = 
(v2, t) < L(v2, to) Rene 


1.05Lo. (10.7) 


Putting (10.6) and (10.7) together, we obtain by (10.5) and the proposed estimate 
u(t) € [4.26, 6.14] m/year that for any ¢ € [fo, 0] 


L(t) R? L(t) R? L(t)R? 
0.95Lo < R 426n2 = _ R 6. 1ap2 = 105Lo- 
The proof is completed. oO 


A more important converse proposition has stronger assumptions on the veloci- 
ties: 


Theorem 10.3 (Additional Two-Sided Estimates) [f the average recession speed 
v lies outside the interval [4.26, 6.14] m/year, then there exists a nonempty subin- 
terval I C [to, 0] such that the luminosity L(v, t) is less than 95 % or greater than 
105 % of Lo for allt € I. 


Proof If 0 < vy = 4.26 m/year, then similarly to (10.6) we get 


L (to) R? L(to)R? 
< 
(R+0t)* ~ (R+ v1 I)? 


L(V, to) = = L(v1, to) = 0.95Lo. 


From the continuity of the rational function t +h L(V, f) it follows that there exists 
a nonempty time interval /; such that £(v, t) < 0.95Lo for all t € Ih. 

Analogously to (10.7) we find that for v > v2 = 6.14m/year there exists a 
nonempty interval J, C [fp, 0] such that £(v, t) > 1.05Lo for all t € h. oO 


An average speed v lying outside the interval [4.26, 6.14] m/year corresponds to 
truly inhospitable conditions for the evolution of life into multicellular forms. The 
previous two theorems can also be easily modified for other values than just 5 %. 


Remark 10.4 In some models the linear function L(t) = (1 — 0.23t/to)Lo from 
(10.2) and Fig. 9.3 is replaced by the rational function (see e.g. [28, p. 177]) 


ig@=—= t € [to, 0] 
~ 14+0.3t/t9’ ie 
where t9 = —4.5 Gyr. In this case the optimal average recession speed (guarantee- 


ing an almost constant energy flux from the Sun) is 


v = 4.36 m/year 
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Fig. 10.4 The behavior of the Hubble parameter H = H(t) according to the ACDM model 
is sketched by the solid line. The dashed-dotted line stands for the corresponding deceleration 
parameter g = —1 — H/H? that was evaluated by means of numerical differentiation of the 
function H = H(t). The lower horizontal axis shows time in Gyr since the Big Bang. In the upper 
horizontal axis we see the corresponding redshift z according to [284] 


and the mean recession speed should be in the interval [3.27, 5.21] m/year to keep 
variations of the solar energy flux below 5 % as in Theorems 10.2 and 10.3. 


Remark 10.5 It is said that the universe expands exponentially due to dark energy. 
From Fig. 10.4 we observe that the Hubble parameter H = H(t) has been almost 
constant during the last 4.5 Gyr when the Solar system was formed. It lies in the 
interval [ Ho, 3 Ho). Assuming that H(t) is constant, i.e., H(t) = Ho, we get by 
(6.27) that Theorem 10.1 can be modified for an exponential expansion as follows. 
If the average recession speed of the Earth from the Sun is v = 5.014 m/year, then 
Lop (t) = 1.36 + 0.008 kW m~? for all t € [t9, 0], which is analogous to (10.1) 
and (10.4). Also Theorems 10.2 and 10.3 can correspondingly be slightly modified. 
Moreover, from Fig. 7.6 we observe that the expansion function a = a(t) is almost 
linear during the last 4.5 Gyr. 
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10.5 A Possible Future Development of Earth’s Orbit 


From Theorems 10.2 and 10.3 we find that the probable secular expansion rate of 
the radius of Earth’s orbit lies in the interval 


H\' & [0.426 Ho, 0.614 Hol. 


Such a local Hubble expansion is therefore perfectly tuned by the apparent repulsive 
forces (see [161]). Therefore, not only the fundamental physical constants are finely 
tuned. For example, the average surface temperature of the Sun T = 5770 K is also 
perfectly balanced. If it were to decrease about 1 %, i.e. a mere 57.7°C, then by the 
Stefan-Boltzmann law Lp = oT“, the solar constant Lp would decrease about 4% 
as 0.994 Ly © 0.96Lo. The total luminosity of the Sun would also decrease by 4 %. 


Remark 10.6 Itis often argued that after half a billion years, the water in the oceans 
will evaporate, because the power of the Sun will be too high (see Fig. 9.3). These 
simplified conclusions do not take into account that the Earth can be inhabited by a 
technically advanced civilization, which can, for instance, place a giant reflection 
sheet at the Lagrangian point L; to control the incoming flux of variable solar 
energy. According to [314, p.461], the Sun’s luminosity will increase to 1.33 Lo 
after 3 Gyr. Thus, it may exceed 1.05 Lo over half a billion years. However, the 
recession speed (10.1) would guarantee very stable conditions on the Earth for 
several Gyr in the future. For instance, in the next 3.5 Gyr the flux density of solar 
energy will be in the very favorable interval 1.33-1.36kW m7’, if the luminosity 
would evolve as in (10.3). The function t +> Lopt(t) is in fact decreasing in the 
interval [0, |fo|] and from relations (10.2) and (10.3) it follows immediately that 


Lopt(Itol) = 1.33kW m~. 


Thus there would be pretty good prospects into the far future. Life on the Earth 
would have had a chance to develop further for a long time period in very stable 
conditions. 


After 5—7 billion years, when reserves of hydrogen in the central region of the 
Sun will run out, the Sun will begin to change to a red giant. Its radius will extend 
beyond the current orbit of Venus. At that time the Earth could be about 180 million 
miles away from the Sun, provided the receding velocity is (10.1). Hence, the tiny 
repulsive forces could keep the Earth sufficiently far away from the growing Sun. 
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10.6 Why Does the Law of Conservation of Energy Not Hold 
in the Physical Universe? 


Einstein’s equations are assembled so that the law of conservation of energy and 
momentum is theoretically valid absolutely exactly. The reason is that the covariant 
divergence of the left-hand and right-hand side of (3.52) is zero. Newtonian mechan- 
ics is also designed in such a way that this law holds exactly. Nevertheless, in this 
and previous chapter we saw that the local Hubble expansion and the corresponding 
energy non-conservation may explain a number of classical paradoxes in the Solar 
system, in particular: 


1. The faint young Sun paradox. 

2. The paradox of the large orbital angular momenta of the Moon, Triton, and 
Charon. 

3. The formation of Uranus and Neptune. 

4. The existence of rivers on Mars and the existence of its satellite Phobos. 

5. The tidal catastrophe of the Moon. 

6. The paradox of the large recession speed of Titan from Saturn. 

7. The tidal locking of the moon Iapetus. 

8. The slow rotation of Mercury and the absence of its moons. 

9. The long existence of rapidly-moving satellites below the stationary orbit of 
Uranus and Neptune. 

10. The long existence of life on Earth. 


Thus we claim that Einstein’s equations satisfying the law of conservation of 
energy and momentum cannot well describe the evolution of the Solar system, 
although no one knows their solution. The main reason it that the energy and 
momentum in the Solar system are not conserved. 


Remark 10.7 The effect of local expansion of the universe has a long history dating 
back to the 1933 paper [240] by Cunliffe George McVittie. By Carrera and Giulini 
[58], Cooperstock et al. [70], and Mashhoon et al. [236] the cosmological constant 
A has no effect on the expansion of the Solar system. However, from Sects. 7.8, 9.2, 
9.3, 10.4, etc., we already know that we can detect the local Hubble expansion even 
in the Solar system. 


The Solar system but also galaxies are unique astrophysical laboratories for 
testing whether or not the law of conservation of energy and momentum holds 
and whether a finite speed of propagation of the gravitational interaction generates 
as a byproduct the sought energy for general expansion (local and global). To 
illustrate this hidden energy source consider two bodies A and B of equal masses 
that orbit symmetrically with respect to their center of gravity. If A attracts B and 
B also attracts A at their instantaneous positions, then by the Newtonian theory of 
gravitation these forces are in the same line and in balance. 

However, the speed of gravitational interaction is finite. Therefore, the body B is 
attracted by A towards its previous position A’ as depicted in Fig. 10.5. Similarly A 
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Fig. 10.5 Trajectories of two 

interacting bodies of equal 

masses form a double spiral. 

The distance between two 

neighboring arms is actually 

much smaller than in the 

picture. The gravitational 

aberration angles ABA’ and 

BAB are extremely small A’ 


but positive 
A 


is attracted by B towards its previous position B’. Then a couple of non-equilibrium 
forces arises that permanently acts on this binary system. It increases the total 
angular momentum and the total energy of the system as well (see [184]). So this 
system is gradually expanding as a result of apparent repulsive forces. 

In the framework of General Relativity Steven Carlip in [55] derived that the 
gravitational aberration angle y of a body with speed v is bounded from above by 
the fraction v*/c? while the angle of light aberration is approximately equal to 


Vv 
a=. 
Cc 


However, in doing so, he assumes that 


(a) the cosmological constant is zero, 
(b) some nonlinear terms, which he cannot estimate, are vanishing, 
(c) the laws of energy and momentum conservation hold exactly. 


Therefore, of course, he cannot obtain spiraling trajectories as schematically 
depicted in Fig. 10.5. Moreover, since the gravitational interaction has the same 
speed of propagation as light, the aberration angles a and y should be the same 
for weak gravitational fields. 

The recession speed and the angle of gravitational aberration depend on masses, 
velocities, and positions (also in the past) of all free bodies that gravitationally 
interact among themselves [157]. Numerical simulations show that any N-body 
system with N > 1 expands on average for a finite speed of gravity. Gravitational 
aberration thus has a nonnegligible influence on the expansion rate of the universe. 
Note that our Earth is not expanding, because it is not a system of free bodies that 
gravitationally interact. 


10.6 Why Does the Law of Conservation of Energy Not Hold in the Physical... 219 


Space deformations by the two interacting bodies may also have a non-negligible 
influence on the local Hubble expansion. Their proper distance is larger than 
their coordinate distance (see Fig. 5.1) and time dilation should also be taken into 
account. 

Recall (see e.g. [5, 6, 18, 136]) that there exist close binary pulsars whose orbital 
period, by contrast, decreases over time. In this case, the system loses kinetic energy 
due to the radiation of gravitational and electromagnetic waves, due to tides, friction 
by interstellar dust, and so on. The resulting forces then prevail over apparent 
repulsive forces. 

Sometimes we must also take into account various resonances that may cause 
much larger effects than these apparent repulsive forces whose manifestations on 
short time scales are indeed negligible. However, they operate continually in any 
gravitationally bound system of asteroids, moons, planets, stars, galaxies, clusters of 
galaxies, or orbiting clusters of galaxies. They contribute to migration of planets and 
their moons over long-term time periods, they cause that star clusters continually 
“evaporate”, they act against gravithermal catastrophe of galactic clusters and star 
clusters, they reduce the frequency of collisions of stars [157] and galaxies, they 
slowly expand the “cosmic web”, they stabilize the Solar system, and so on. 
Apparent repulsive forces also helped to create the conditions suitable for the 
emergence of life on Earth over a period of several billion years, during which solar 
power was growing. 


Chapter 11 ®) 
Gravitational Waves Sees 


11.1 The First Detection of Gravitational Waves 


In this chapter we show that masses of binary black hole mergers are overestimated, 
since a large gravitational redshift is not taken into account (see (11.7) and (11.9) 
below) which is a direct consequence of time dilation in a strong gravitational field. 
This fact allows us to explain a high mass gap between observed binary neutron 
stars and calculated binary black hole mergers. We also present other reasons taken 
from [199] why masses of black hole mergers are determined incorrectly. 

In 2016, the letter [2] about the first detection of gravitational waves was 
published. They were generated by two merging black holes that had approximately 
36 and 29 solar masses, the final black hole had 62 solar masses and 3 solar masses 
turned into gravitational waves. Now we will analyze this event in detail. 

In 1916 Gust a century before [2] has appeared), Albert Einstein [93, 95] 
predicted the existence of gravitational waves. He assumed only very weak gravita- 
tional fields which enabled him to linearize his field equations of General Relativity. 
He considered only small perturbations of Minkowski spacetime and after some 
further simplifications he got a nonhomogeneous partial differential equation with 
the d’Alembert operator for plane gravitational waves (see e.g. [32, p. 24]). 

Gravitational waves were first detected on September 14, 2015. According to [2], 
two black holes with masses 


m, =3613Mo and mz = 29*4Mo, (11.1) 


merged and the generated gravitational waves GW150914 were independently 
intercepted by two LIGO detectors. 

We show that the masses in (11.1) are not too trustworthy. First of all, we would 
like to emphasize that our criticism does not concern the LIGO detectors themselves, 
which are highly sophisticated and remarkable instruments [23]. It concerns the 
methodology that was used to process and then interpret the measured data. The 
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Fig. 11.1 Depiction of the 
passage of a planar 
gravitational wave through 
two detectors LIGO in USA 


Hanford 


applied post-Newtonian model assumes an infinite speed of gravity and neglects the 
infinite gravitational redshift of each single black hole. Therefore, it can only very 
roughly approximate reality. It is true that it works pretty well in the Solar system. 
But since this post-Newtonian model is situated in Euclidean space, it does not 
allow one to consider a highly curved spacetime near black holes. Hence, from such 
a heuristic model one cannot establish any credible conclusions about the actual 
masses of these black holes and derive any reliable error estimates. 


Remark 11.1 Two LIGO detectors whose distance is d = 3002km recorded the 
first passage of a gravitational wave. The time delay between these two detections 
was At = 6.9 ms (see [2, p.2]). This immediately leads to the following upper 
bound on the speed of gravitational waves 


dsina d 
Cg = < — = 435,072km/s 
At At 


in a vacuum or in the Earth body, see Fig. 11.1. We observe that this guaranteed 
upper bound is larger than the speed of light in a vacuum c = 299,792,458 m/s. 
However, since 2017 we know that cg = c with very high accuracy due to 
observations of a binary neutron star merger, see [5, 6]. The corresponding 
gravitational waves GW170817 were detected on Earth practically at the same time 
as electromagnetic waves. Note that the relation cg = c was discussed by Henri 
Poincaré [294, p. 1507] already in 1905. Anyway, it is surprising that Einstein’s 
equations (3.52) describing gravity do not contain the speed of gravitational 
interaction cg, but the speed of electromagnetic interaction c. 


The signal from LIGO and VIRGO detectors of gravitational waves is continually 
corrupted by white noise due to thermal movement of particles, seismic waves, 
quantum noise, etc. It is noteworthy that such noise does not seem to be present 
in Fig. | of [2] in a 40 ms long time interval around the maximum amplitude of 
the detected signal. The white noise can be partially suppressed from the detected 
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signal by means of a wavelet transform which removes components with a small 
amplitude and high frequencies at the same time (see e.g. [76, 242]). 


11.2 Emitted Versus Detected Frequencies 


The only relation which is given on a single line in [2] reads: 


—  oamyP PP 5 gs. 33 
ag mes alae f /| ; an?) 


where f = f(t) and f= 7 (t) are the frequency of gravitational waves in time f 
and its time derivative, m, and m2 are the masses of the components of the binary 
system for low frequencies, M is the chirp mass in the detector frame, and G = 
6.674 - 10-!! m3 kg~!s~? is the gravitational constant. The chirp mass determines 
the frequency evolution of the gravitational waves during a binary’s inspiral phase. 

According to [2, p.3], f is the detected frequency. However, then the formula 
(11.2) cannot be true, in general, since f on its right-hand side essentially depends 
on the total redshift z, whereas the left-hand side of (11.2) is independent of z. 
The masses m, and m2 cannot depend on z. Therefore, (11.2) can be valid only for 
z = 0. Note that formula (11.2) is based on the paper [33] published in 1995. 
However, in [33, p.3516] and [74, p. 2663], the authors correctly consider the 
emitted frequency fem (or the orbital frequency 2 fem) in the source frame and not 
the detected frequency f as in [2]. Therefore, the detected frequency f should be 
replaced by the emitted frequency 


fem = (2+ DF, (11.3) 


where z is the total redshift of gravitational waves. 

The masses m, and mz were established by a series of numerical simulations 
[254] with various input values m, and m2. The numerical methods employed in 
constructing the templates for the merging black holes can also be found in [63] and 
[281]. Let us emphasize that no gravitational redshift is taken into account in the 
cited papers [63, 254, 281] on numerical simulations. 

Note also that the corresponding modeling error cannot be established, since the 
true analytical solution of Einstein’s equation for two orbiting bodies with positive 
masses is not known. Moreover, from [2] one cannot find the method by which the 
redshift z was estimated and how the derivative Fi from (11.2) was calculated. There 
is also no relevant reference to this issue. 

Now let us look at the equality (11.2) in more detail. This equality was derived 
from the relation (3) of the 1995 paper [33] in a manner such that several higher 
order terms were neglected. In spite of that, the authors of [2] kept the equality sign 
in (11.2). 
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Example 11.1 Notice that the left-hand side of equality (11.2) does not depend 
on time t, while the right-hand side is time dependent. Thus the equality (11.2) 
is satisfied if and only if the product f—!!/3 f is a suitable constant C (e.g. 
C = 0.00015142... s°/> for (11.1)). This leads to the solution of an ordinary 
differential equation of the first order 


jou (11.4) 
whose general solution is 
3\ 3/8 1 
th= ; 11.5 
FO (3) (K — cr)3/8 “ 


where K € (—oo, 00) is an arbitrary integration constant, see also [74, p. 2663]. By 
Rektorys et al. [297, p. 14] the function (11.5) is the only solution of the differential 
Eq. (11.4) fort < K/C. Since this equation is autonomous, we may choose K = 0, 
i.e. t < 0. Then relation (11.5) determines how the frequency increases with time. 


The authors of [2] had only approximate values of the detected frequency f from 
the interval 35-250 Hz for about 8 “measured orbital periods”. Finally note that 
a numerical computation of the derivatives from smoothed inexact data is an ill- 
conditioned problem which may produce incorrect results (see e.g. [323, p. 144]). A 
detailed analysis of the smoothed signal GW150914 can be found in [394, 395]. 

For the total mass M = m, + mz and the reduced mass 4 = m\m2/M we set 
M, = wM?!>. We also set M2 = m(m/M)!/>, where m = ./m mz is the 
geometric mean. The next theorem states that these formulae represent equivalent 
definitions of the chirp mass. 


Theorem 11.1 We have M = M, = Mp2. 
Proof From (11.2) we find that 


(mym2)*/° mim \3/5 
~ (my +m) 5 sree, (my +m)? = WPM? = My 
and 
(mim2)3/> (mimz)!/10 mi(m)" M 
= SC mim =m — . 
(my + mz)!/5 On +m) 5 M 


The proof is completed. Oo 
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11.3. Neglected Gravitational Redshift 


Unfortunately, in [2] (also in [3, 4, 7, 33, 148]) there is no mention about gravi- 
tational redshift of gravitational waves. Let us recall that redshift (or blueshift) of 
frequency of waves coming to us from the universe has three basic components: 


1. A Doppler component caused by the movement of the source or the observer 
with respect to its neighborhood. 

2. A cosmological component caused by the expansion of the universe. 

3. A gravitational component caused by the change of frequency of waves in a 
gravitational field. 


According to [2, p.7], the luminosity distance of the considered binary system 
is ALOT en Mpc, which is in perfect agreement with the cosmological redshift (see 
e.g. [284, 393]) 


= +0.03 
z= 0.0975 po; (11.6) 
that is stated by the authors. Thus for the two remaining components of the redshift 
it remains 


z0. (11.7) 


Note that the corresponding Doppler redshift cannot be reliably determined, 
because the orientation of the orbital plane and the local movement of the binary 
merger are not known. 

Let us also note that the gravitational redshift for the surface of a neutron star is 
z © 0.3 which is greater than in formula (11.6), which can be derived from (11.10) 
below. For the horizon of a single black hole with mass m with Schwarzschild radius 


2G 
a (11.8) 
c 
we even have 
Z= OO. (11.9) 


Comparing this with the relation (11.7), we find that the authors of [2] did not 
consider the large gravitational redshift caused by the binary black hole system. 

The gravitational redshift is a direct consequence of Einstein’s time dilatation. 
Time in a gravitational potential hole flows more slowly than outside. A photon has 
to spend some energy to leave a gravitational field of a mass object. Its frequency is 
indirectly proportional to the speed of the passing of time. Therefore, the frequency 
of electromagnetic waves decreases when leaving a large gravitational potential hole 
of the binary system. A similar phenomenon holds for gravitational waves that carry 
away energy and thus their frequency will decrease as well. 
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So let us recall the well-known formula (2.22) that can be derived from the 
Schwarzschild solution of Einstein’s equations. It expresses the change of frequency 
of a photon leaving the gravitational field of a single black hole at the distance R > r 


from its center 
" 
f = fem,/1— x (11.10) 


where r is given by (11.8), fem is the emitted frequency of a photon and f/f is the 
detected frequency by a distant observer. From this and (11.3) we obtain the limiting 
relation (11.9) for R > r. 


Remark 11.2 Relations (11.8) and (11.10) can also be used to derive that the 
detected frequency is negligibly changed by the gravitational field at the measure- 
ment site, such as Earth, since r = 9 mm. 


Example 11.2 Setting for instance R = 2r in (11.10) (cf. [2, p.3]), we find that 
f =27'/? fom. By (11.3) the corresponding gravitational redshift is 


z=V2-1=0.414 (11.11) 


which is much larger than the observed cosmological redshift in (11.6). Similarly, 
for R = 3r, 4r, and 5r, we get z = 0.225, 0.155, and 0.118, respectively, which are 
also larger than (11.6). 


Just before the collision of the two black holes, the spacetime between them 
exhibited the largest deformations. By the measured data of LIGO detectors it 
produced gravitational waves with an increasing recorded frequency 35-250 Hz. 
The distance between both the black holes was only a few Schwarzschild radii 
[2, p.3]. From (11.8)-(11.11) we may deduce that the gravitational redshift of the 
emitted gravitational waves will be quite essential and larger than that in (11.6). 
Namely, a common gravitational potential hole of the two black holes is deeper than 
that of each of its components (see Fig. 11.2). 


Example 11.3 From equality (11.2) itis obvious that the sought masses of the black 
holes and also the constant C of (11.4) depend on the emitted frequency fem = 
(z + 1) f. Consequently, a proper determination of the total redshift z is essential. 
By relation (11.3) we obtain an = (z+ 1)? f , where the additional factor (z + 1) is 
due to Einstein’s time dilatation. Substituting this and (11.3) into (11.2), we get the 
missing factor 


(z+), (11.12) 


because 


(neyy” = «+ D/( a (11.13) 
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Fig. 11.2 Schematic illustration of the gravitational potential of a binary black hole system. The 
largest deformation of the spacetime arises close to the central saddle point. According to [2, p. 3] 
the highest amplitude of the detected signal was reached for the separation R = 3r 


and because the remaining factors in (11.2) are constants. In [2] only the cosmolog- 
ical redshift (11.6) was considered, but the total redshift is larger. For instance, from 
(11.6) and (11.11) we may deduce that the total redshift z = 0.09 + 0.414 could be 
slightly larger than 5 Hence, the masses (11.1) were overestimated. 


Example 11.4 Suppose for simplicity that 
m, =m. 
Then from (11.2) we find that the corresponding chirp mass 
M = m9? /(m + m)!/5 = 27'S 


linearly depends on my. 


11.4 Other Arguments 


The mechanism of the origin of the binary black hole system (11.1) is not known. 
According to the survey paper [71, Fig. 8], all known X-ray binaries detected in our 
Galaxy have components with masses less than 1OMo. According to [49, 62, 256], 
masses of all known single stellar mass black holes are in the interval 5Mo-20Mo 
(e.g. Cygnus X-1 has about 20Mo@ and its accretion disc produces radio waves due 
to gravitational redshift). Hence, from a statistical point of view a system of two 
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much larger black holes such as (11.1) seems to be quite exceptional, even though 
some selection effects may be present, since larger masses imply stronger signals. 


Remark 11.3 By Abbott et al. [7] the mass of the larger component of the binary 
black hole corresponding to the event GW190521 was 85 Mo and the total mass 
after coalescence was 150 Mo. The evolution path of such a large black hole binary 
is unknown. It is clear that many of its parameters in a common envelope of gas had 
to be tuned very finely. 


Remark 11.4 There is a large statistically significant mass gap between all known 
black hole mergers and binary neutron stars, and also between single stellar black 
holes and binary black hole mergers, see e.g. [8, 343]. This also indicates that the 
gravitational redshift was ignored. Note that magnetic fields of a binary neutron star 
merger significantly contribute to the final coalescence, since they are very strong at 
short distances r (like permanent bar magnets). From Remark 9.7 we know that the 
magnetic potential is proportional to r~? while the gravitational potential behaves 
like r~!. The corresponding attractive forces are proportional to the gradient of these 
potentials. 


Remark 11.5 Chen et al. in [64] have also found that the important gravitational 
redshift of GW150914 was not accounted for. They assume that the black hole 
binary system was located in close vicinity of a supermassive black hole possessing 
a large gravitational redshift. Their results indicate that the mass of each component 
of such a pair of black holes is not greater than 1OMo. 


Remark 11.6 The wavelength 4 = c/f corresponding to the highest detected 
frequency f = 250 Hz is equal to A = 1200 km. This is a much larger value than the 
diameter of the wave zone given by the associated Schwarzschild radii of the black 
holes. Nevertheless, we observe that the emitted frequency fem > f would yield 
a much more reliable size of the wave zone. This also shows that the gravitational 
redshift was missing. 


Remark 11.7 According to (11.10), a distant observer will theoretically see that the 
fall of any mass object into a single black hole will take an infinitely long time. 
Actually, he should not see the plunging of this object into the black hole due to 
(11.9). On the other hand, a measured collision of the black holes (11.1) lasted 
only 40 ms. Is not that a contradiction between theory and reality? This comparison 
again shows that we should never equate simple theoretical models with the physical 
world. Moreover, since the number of emitted photons is finite, the observer will see 
that the fall will last only a finite time. 


Remark 11.8 The smoothed signals presented from both the detectors show rela- 
tively small third and fourth amplitudes with high frequency appearing after the 
maximal amplitude, see [2, p.3]. However, in principle wavelet analysis does not 
allow one to detect such small amplitudes with high frequencies in noisy data. The 
question is whether the noise removed from the signals (the so-called residual) from 
both the detectors is correlated or not. It appeared that it is uncorrelated except 
for that particular 40 ms long interval, where no noise seems to be present, see 
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Fig. 11.3. This naive illustration implies that the orbital velocity v of binary black holes is larger 
than the speed of light c, see (11.14). It shows only a dipole character of gravitational waves and 
not their proclaimed quadrupole character 


(73, 225, 258]. In this time interval, the removed noise from both the detectors is 
correlated. It was enough to shift the noise from the detector in Livingston about 
7 ms similarly as in the detected signal. The corresponding figure can be found in 
[147]. In [2] such an important comparison is missing. The removed noise from the 
other two detected gravitational waves GW151226 and GW170104 is correlated as 
well. Therefore, such correlated noise could cause a bias in orbital frequencies of 
black holes such as (11.1). 


Remark 11.9 For the time being we do not know what is the speed of gravitational 
waves for other media than vacuum (e.g. inside the Sun). Does this speed depend on 
the frequency as the speed of electromagnetic waves? 


Remark 11.10 The famous Fig. 11.3, which should illustrate the propagation of 
gravitational waves, contradicts the General Theory of Relativity, see [201]. To see 
this, denote by d the distance of the two black holes and by T their orbital period. 
Multiply the trivial inequality 


r>2 


by d/T. Then we immediately get a contradiction 


md 2d |AB| 
> = =¢; 
Go | T 


v= (11.14) 
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where the orbital velocity v is larger than the speed of gravitational waves c (equal to 
the speed of light) and | A B| is the distance of two consecutive maximum amplitudes 
of the right black hole as indicated in Fig. 11.3. However, vu < 5c by Abbott et al. 
[2]. 


Remark 11.11 Recall that the Archimedean spiral is by definition a locus corre- 
sponding to the locations over time of a point moving away from a fixed point 
with a constant speed along a line that rotates with constant angular velocity (see 
[296, p. 136]). The distance between individual revolutions of this spiral in radial 
direction is constant. Another objection to Fig. 11.3 is that the gravitational waves 
should approximately form a double Archimedean spiral. Such a spiral should leave 
each black hole in an almost radial direction and not in the tangential direction as 
illustrated in the center of Fig. 11.3. Moreover, it is easy to see that Fig. 11.3 shows 
only a dipole and not quadrupole character of gravitational waves, in which the 
density of waves would be doubled. Also the exposed spacetime near the center 
that produced the largest amplitudes of gravitational waves, should be much more 
largely deformed. 


11.5 Conclusions 


A comprehensive and detailed study of gravitational waves and their role in 
astrophysics and cosmology is given in [233]. Our hypothesis that masses of binary 
hole mergers are overestimated is based on the following arguments: 


1. The key formula (11.2) possesses several essential drawbacks which are 
described in Sect. 11.2. 

2. In the literature on gravitational waves, no gravitational redshift is taken into 
account while the redshift on horizon of each black hole is infinity, compare 
(11.7) with (11.9). 

3. One should carefully distinguish between an observed and calculated stellar 
black hole. As of 2022, the mass of the largest observed stellar black hole 
Cygnus X-1 is only about 21 Mo, whereas it is 150 Mo for the largest 
calculated stellar black hole from gravitational waves, see [7]. 

4. No mechanism is known which would produce binary stellar black holes with 
masses greater than 50 solar masses, see Remark 11.3. 

5. There is a large and statistically significant mass gap between all known black 
hole mergers and binary neutron stars, see Remark 11.4. 

6. There is another large statistically significant mass gap between observed single 
stellar black holes and calculated binary black holes, see Remark 11.4. 

7. Our hypothesis yields a more reliable size of the wave zone than in [2], see 
Remark 11.6. 

8. Very strong gravitational fields around binary black hole mergers were 
described and calculated by a post-Newtonian model and not by Einstein’s 
equations. 
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Fig. 11.4 A schematic 
illustration of gravitational 
interaction between two 
galaxies G and H in an 
expanding universe 


9. Acurved spacetime around physical black hole mergers is ignored. 
10. A black hole on a piece of paper is a completely different object than a physical 
black hole. 


Therefore, something has to be wrong in the current analysis of GW signals even 
though there are surely various selection effects. The claim that two merging black 
holes had masses (11.1) seems to be somewhat too strong. As shown in Sect. 11.2, 
the main reason is that the detected frequency f appearing in (11.2) has to be 
replaced by the emitted frequency fem. In other words, the chirp mass given by 
(11.2) has to be divided by the missing factor (11.12). We hope that this crucial 
mistake will be removed soon and that it will be not repeated in the coming decades. 


Remark 11.12 It is not known whether the speed of gravitational waves is the same 
as the speed of gravitational interaction. Anyway, they are both finite. Consider two 
instants t) < f2 of cosmic time. In Fig. 11.4 we observe the gravitational interaction 
between two galaxies G and H in an expanding universe (cf. Fig. 6.2). By G; we 
denote the galaxy G at time 7; fori = 1,2, and similarly for H;. Thus, the dashed 
line segments G1 G2 and H H2 denote trajectories of G and H in the corresponding 
spacetime, respectively. Gravitational interaction does not act on objects in the past, 
but because of the time delay, it affects the future. 


Chapter 12 ®) 
Possible Distribution of Mass Inside a ml 
Black Hole 


12.1 A Remarkable Coincidence 


It is generally believed that the center of any physical black hole contains a point 
singularity with infinite mass density, see [275, 276, 278]. This seems to be only a 
mathematical idealization, since no physical quantity can attain infinite values due 
to quantum phenomena. Moreover, no point can rotate and black holes have well 
defined spins. 

Here we introduce our own hypothesis from [166]. We suggest that the hidden 
center of any black hole could be occupied by a neutron-like star composed of 
neutrons and/or quark-gluon plasma. So there is no singularity in the center. 

The maximum mass of a neutron star is at most three solar masses (cf. [6]). We 
show that in this case the radius of such a neutron star is approximately equal to its 
Schwarzschild radius (3.10). Adding a small amount of matter to this star, a black 
hole arises. Thus its interior could contain a compact object below the event horizon 
instead of the proposed point singularity. 

The mass of the neutron is (see [269]) 


m = 1.675: 10-*’ kg (12.1) 
and its interaction diameter is given by d = ,/4a/m ~ 1.65- 107! m, where o is 


the effective cross section. Assuming spherical symmetry, the density of the neutron 
is approximately 


= 0.712 - 10!8 kg/m?. (12.2) 


Pn ~ I 
i 3 
gud 


In our Galaxy, more than 2000 neutron stars are already known. By Burgio et 
al. [52] most of their masses fall in a relatively narrow interval from 1.45Mo to 
1.65Mo, where Mo = 2 - 10°°kg is the mass of the Sun. Their mass is usually 
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2 Possible Distribution of Mass Inside a Black Hole 


Fig. 12.1 The maximum 
density of equal balls in the 
three-dimensional space is 
about 74 %. The maximum 
density of (possibly 
deformed) neutrons can be 
even higher 


greater than the Chandrasekhar limit 1.4Mo for white dwarfs. However, rarely there 
exist neutron stars with masses close to Mg (see [216]), since during the collapse of 
a white dwarf onto a neutron star, some amount of matter is thrown away. 

The mass of each neutron star is smaller than the Tolman-Oppenheimer-Volkoff 
(TOV) limit which is approximately (see e.g. [6, 216, 223]) 


M :=3Mo. (12.3) 


As of 2022, the mass of the biggest known neutron stars (PSR JO952-0607) exceeds 
2Mo. Therefore, the above upper limit (12.3) may be further reduced in the future. 
Anyway, as of 2022, IGR J17091-3624 is the smallest known black hole with mass 
3-7 Mo. 

The neutron star mass density is likely not constant and increases towards the 
center. Thus, the mean density of a neutron star should be even higher than (12.2), 
i.e. denser than an atomic nucleus, cf. Fig. 12.1. It is assumed that the central part 
of any neutron star is occupied by a quark-gluon plasma which is a Fermi liquid 
satisfying the Pauli exclusion principle, see Remark 5.7. Therefore, the central quark 
density could be several times higher than (12.2). However, here we should take into 
account also Remark 3.8. Moreover, we should not identify our simple models with 
reality. 

According to [36], hyperons appear in dense matter in the interior of a neutron 
star which yield approximately a double mean density ¢ = 2¢, compared to (12.2). 
Then for 


4 3 
ee Waa 


given by (12.3) we obtain that the largest possible radius of such a neutron (hyperon) 


star is about 
,| 3M 
R= .|}— & 10km. (12.4) 
Arp 


12.2 Possible Distribution of Mass Inside a Black Hole 235 


Fig. 12.2. The lower part of the event horizon of a black hole is represented by a hemisphere. It 
is not made of any material substance. The interior of a black hole larger than 3Mo could contain 
below its event horizon a star-like object composed of dense neutron/quark matter instead of a 
point singularity 


Recalling the definition of the Schwarzschild radius (3.10) 


2GM 
sS=—, 


(12.5) 
(Gs 


we find that S corresponding to M = 3Mo has a very similar size as (12.4), namely 
S=9km. (12.6) 


In Sect. 3.3 we derived finer estimates that take into account that the space inside 
the neutron star is curved and thus it has a larger relativistic volume than Euclidean 
volume of a massless ball with the same circumference (cf. Fig. 3.3). Adding a small 
amount of mass to the TOV limit in (12.3), the body becomes a black hole, but its 
central part below the event horizon could still remain in the state of neutron matter 
and/or quark-gluon plasma (see Fig. 12.2). It does not have to collapse to a point 
singularity (see Fig. 12.3 and [166]) due to the Pauli exclusion principle. However, 
for the time being we do not know whether the Pauli principle could be violated 
for extremely strong gravitational fields. Let us again emphasize that this does not 
follow from Einstein’s equations, since they do not describe quantum phenomena. 


12.2 Possible Distribution of Mass Inside a Black Hole 


It is estimated that there are over 10° black holes in our Galaxy and 10!” in the entire 
observable universe. Let us denote their coordinate Schwarzschild radius (12.5) as 
follows 


—=M. (12.7) 
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Fig. 12.3. Schematic spacetime diagrams of gravitational collapse of a star larger than 3Mo toa 
black hole. The time axis is vertical. The interior contains a point singularity (left) or a neutron-like 
star below the event horizon (right) 


Table 12.1 The : R, (in meters) |p, (in kg/m) R, (in meters) 
Schwarzschild radius R, and  —  Oorm"— 


the mean mass density p, of lkg 10-77 10 6» 107 
various objects. The last Earth | 0.009 10° 100 
column contains radii R, of Mo 3000 102° 6979 
objects with maximum 3Mo 9000 10!8 10,000 
neutron star density p Ser A* | 10!0 10° 1.4- 10° 
M87*_ | 1.5- 10!% 1 108 


for a given mass M,. Then the corresponding mean mass density in the SI units is 


6 


1 
~ Const.; —* © 108° kg/m? (12.8) 
R3 G3 M3 M2 


M e 


Pe © const. 


with some small (cf. (3.28)) dimensionless constants const. and Const. Conse- 
quently, the mean mass density o, of a black hole is indirectly proportional to the 
square of its mass M,, while the mean mass density p = M/V of a usual object is 
directly proportional to its mass M for a fixed volume V. In Table 12.1, we present 
hypothetical values of black hole mean densities of various objects. 

The values in Table 12.1 associated to the central black hole Sgr A* of the 
Milky Way are derived e.g. in [166]. The last line of Table 12.1 corresponds to the 
supermassive black hole M87* in the center the supergiant elliptical galaxy M87 in 
the Virgo cluster. By (12.7) and (12.8) its mean mass density is p. © 1 kg/m?, since 
this black hole is about 1500 more massive than Sgr A*. This mean density is thus 
like the density of air. 


Remark 12.1 It is surprising that rotational axes of both photon rings of M87* 
and Ser A%* are directed to us (see [13]), even though the jets of M87 are directed 
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elsewhere. The angle between the relativistic jets of M87 and the line of sight can 
be determined by the method described in [184, p. 310]. 


The limit values Mo and 3Mo appearing in Table 12.1 were investigated in 
previous sections. From Table 12.1 we further observe that the Earth being a black 
hole would have an incredibly large mean density, 12 orders higher than the density 
of the neutron (12.2). Moreover, 1 kg black hole would have by (12.8) density 
10°° kg/m?. 

Anxieties that in CERN miniature black holes could be produced are unjustified, 
since they would have unrealistic density. Another argument is that maximum 
energy of each proton in the LHC is about 10!*eV, while from the universe 
we receive protons having over 10?9eV and nothing happens. Namely, in our 
neighborhood we do not observe any miniature black holes, even though the surface 
of the Earth and other planets is continually bombarded by particles of cosmic rays 
with extremely high energies. 


12.3. Other Properties 


Remark 12.2 From (12.1)—(12.4) and the above discussion in Sect. 12.1 we find that 
each neutron star contains about 10°’ baryons. 


Remark 12.3 The existence of Hawking’s micro (or mini) black holes [125] due to 
Hawking radiation is very improbable. By (12.8) their mean mass density would 
be much larger than (12.2), see the third column of Table 12.1. In fact, from (12.7) 
and (12.8) we get 


M 3c7Re 361 4 
TR = aGR = 1.6-10 pz ke/m 


Pe 


(taking into account (3.28)). Hence, the mean mass density p, converges to infinity 
as R, — 0. Thus, Hawking evaporation would yield absolutely unrealistic mean 
densities. In fact, Stephen Hawking uses a trivial algebraic relation (12.5), but 
the reality can be quite different. He assumes that black holes are kept together 
by gravity, but does not take into account that the repulsive strong force is 
locally at least 40 orders of magnitude larger than the gravitational force and that 
Einstein’s equations do not consider the strong force. Moreover, the mean density of 
Hawking’s miniature black holes would increase with decreasing gravitation caused 
by radiation, which is absurd. It is more probable that for black holes of masses 
about 3Mo a neutron star could appear after Hawking’s radiation if it exists. 


Remark 12.4 We can conclude that gravity should not hold together objects with 
mass smaller than ~ Mo and densities larger than the density (12.2) of the neutron 
or quark. On the other hand, the interior of black holes with mass larger than 3Mo 
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Fig. 12.4 The jets of a radio source Hercules A are | million light years long (photo NASA) 


could contain below the event horizon a star-like object composed of dense neutron 
or quark matter instead of the proclaimed point singularity. 

If the core of a black hole rotates (cf. Fig. 12.2), then the horizon need not be 
exactly spherically symmetric. Its shape could resemble an ellipsoid or other axially 
symmetric body. The theory says that each black hole is characterized only by three 
parameters: its mass, spin, and electric charge (see [249, p.876] for the no-hair 
theorem). But the reality can be different (see also Remark 3.6). 


Remark 12.5 The jets of the radio source Hercules A in the galaxy 3C 348 are 
£ = 10° ly long (1 ly © 10!° m). Their shape can be roughly approximated by a 
huge double cone with volume V and basis area rr for r = 10° ly (see Fig. 12.4). 
Assuming a ten times larger density p = 10~*> kg/m? of these jets than is the mean 
cosmic matter density 10~*° km/m?, we can roughly estimate their total mass M as 
follows: 


1 
M=pV= pratar = 10-752 10°?(107)” kg m 2-10 kg = 10°Mo. 


This is a consequence of the very strong magnetic field around the central black 
hole. If the magnetic field is due to a central compact body below the horizon (see 
Remark 3.6), it cannot be excluded that some black holes (namely quasars) should 
be rather considered as black & white holes. These B&W holes probably cannot 
be entered from the axis of jets. Note that quasars are found only at cosmological 
distances. Their remnants in our neighborhood are not so active now, since they have 
already used up most of their own energy. 
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12.4 Final Conclusions 


Can mathematics explain the current crisis in cosmology? Our criticism of the 
standard cosmological model is based mainly on a rigorous use of mathematical 
tools and methods. We wanted to draw attention to the errors and mistakes made 
in contemporary cosmology. When reading any sort of cosmological literature, it 
is often not clear what is a definition, what is an assumption, what is a claim, 
what is experimentally verified fact, what is just a nice numerical simulation 
or artificially colored image, and what is reality (cf. also [325]). A number of 
cosmological definitions are vague. In addition, various quantities in the standard 
cosmological model and measured values are denoted by the same letter and without 
any warning they are sometimes arbitrarily interchanged. From heuristically derived 
equations important conclusions are drawn. Measured data is sometimes interpreted 
incorrectly, excessive extrapolations are made, various consequences are not in the 
form of mathematical implications, etc. Intuition can sometimes be misleading and 
lead to incorrect conclusions. It is a pity that there is not closer cooperation between 
mathematicians and cosmologists. Fortunately, if cosmologists are wrong about our 
universe, nothing happens. 

We showed that the computational complexity of Einstein’s equations is so large 
that one cannot calculate trajectories of two (or more) gravitationally interacting 
mass bodies. Their applicability is therefore quite limited. They satisfy the law of 
conservation of energy and momentum exactly. However, we present dozens of 
examples in the physical world showing a slight deviation from this law on short 
time intervals. Of course, we do not claim that the law of conservation of energy 
and momentum should not be applied. We cannot avoid it in many calculations, 
since it makes most situations much easier. 

Revolutionary developments in physics such as Newton’s theory of gravity, 
Special and General Relativity, and quantum mechanics came at a time when some 
researchers found the courage to get out of a scientific rut, and looked at natural 
phenomena and measured data in an entirely new way. The prevailing idea of 
contemporary cosmology is that 68 % of the universe is composed of some exotic 
dark energy, 27 % of an unknown dark matter, and only less than 5 % of normal 
baryonic matter. However, all models (with no exceptions!) which are used to 
describe the evolution of the universe were tested only on much smaller spatial 
and time scales. When they are applied to the entire universe, we necessarily make a 
considerable extrapolation without any guarantee that the result obtained is correct. 
The combination of measured data and model prediction is then interpreted as 
the presence of dark matter and dark energy, i.e., these quantities exist only by 
definition (see [178, p.11]). The goal of this publication was to point out some 
treacherous pitfalls that cosmologists encounter if they identify the results of simple 
mathematical models with physical world. 

While there are hundreds of books on cosmology which more or less repeat the 
same things in a sluggish general consensus, our book carefully analyzes many 
major basic cosmological statements, identifying their weaknesses and sometimes 
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disproving supposedly well-established results. We are well aware that our book will 
receive criticisms from pundits in the main astrophysical stream, since it disputes 
many of the usual considerations. However, it is also healthy for Science to listen to 
some different voices from the mathematical community, since such an exercise can 
only reinforce the validity of the scientific method, and engender more innovative 
and empirically based understandings of our universe. 
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